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ABSTRACT 

Upper  and  lower  bounds  were  determined  for  a  variation  of  Schmidt's  statistic 
using  Imhoff  s  distribution  for  quadratic  forms  in  normal  variables.  This  statistic  is 
able  to  detect  a  fourth  order  autoregressive  disturbance  of  the  form:  et  =  p  j  ct_  j  + 
P4et-4  +  Ht  m  tne  general  linear  model  Y  =  XP  +  £. 

To  correct  for  this  disturbance  and  thus  yield  efficient  regression  estimates,  a 
data  transformation  was  derived  using  the  inverse  of  the  variance-covariance  matrix  as 
defined  by  Siddiqui. 
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I.  INTRODUCTION 

A.       BACKGROUND 

Extensive  work  has  been  accomplished  recently  in  the  area  of  modeling  and 
predicting  quarterly  overhead  costs  for  aircraft  manufacturers.  Overhead  costs  are 
generally  predicted  utilizing  estimated  overhead  rates  applied  to  labor  hours  or  costs 
over  functional  categories  such  as  manufacturing  or  engineering.  Changes  in  operating 
rates  cause  overhead  rate  changes  which  may  be  observed  only  after  a  significant  lag  in 
time.  Recent  work  done  in  this  area  has  been  to  estimate  total  overhead  as  a  function 
of  the  number  of  direct  manufacturing  personnel  [Ref.  1]. 

Data  collected  for  purposes  of  estimating  overhead  costs,  since  it  is  of  a  time 
series  nature,  can  be  expected  to  exhibit  some  degree  of  autocorrelation.  Specifically, 
data  collected  for  recent  purposes  has  exhibited  first  order  autoregressive  AR(1),  fourth 
order  autoregressive  AR(4),  or  a  combination  of  these  processes.  An  AR(1)  process 
occurs  when  the  errors  in  adjacent  time  periods  are  related.  This  type  of  relationship 
would  be  expected  in  yearly  cost  and  operating  data.  A  special  form  of  the  fourth 
order  autoregressive  would  be  expected  in  data  of  a  seasonal  nature,  i.e.  quarterly 
observations.  Errors  in  this  special  case  of  the  AR(4)  process  would  be  related  to 
errors  which  occured  four  quarters  previously. 

The  method  of  ordinary  least  squares  (OLS)  is  used  to  regress  the  data  and 
estimate  overhead  costs.  Ordinary  least  squares'  validity  is  based  on  certain  key 
assumptions: 

•  Errors  are  distributed  independently  of  the  explanatory  variables  with  zero  mean 
and  constant  variance. 

•  Successive  errors  are  independent  of  each  other. 

With  autocorrelation  present,  assumption  two  is  violated  and  the  ordinary  least 
squares  procedure  breaks  down  at  three  points: 

•  Estimates  of  regression  coefficients,  though  unbiased  are  not  efficient. 

•  The  usual  formula  for  the  variance  of  an  estimate  no  longer  applies  and  is  liable 
to  seriously  underestimate  the  true  variance. 

•  t  and  F  distributions,  used  for  making  confidence  statements  are  no  longer  valid 
[Ref.  2]. 

Since  the  OLS  procedure  breaks-down  with  the  presence  of  autocorrelation,  it  is 

important  that  its  presence  be  detected  and  the  disturbance  in  the  data  subsequently 


corrected.  Without  accounting  for  these  disturbances  in  the  data,  estimates  of  the 
regression  coefficients  will  not  have  minimum  variance. 

The  Durbin-Watson  statistic  is  utilized  to  detect  the  presence  of  AR(1)  behavior. 
Additionally,  it  has  been  generalized  to  detect  higher  order  autoregressive  processes. 
Durbin  and  Watson's  work  was  based  on  the  findings  of  T.W.  Anderson  which  showed 
that  the  statistic 

e'Ae  /  e'e,  (eqn  1.1) 

where  e  is  the  column  vector  of  residuals  from  a  regression  and  A  is  a  certain  real 
symmetric  matrix,  provides  a  test  that  is  uniformly  most  powerful  against  certain 
alternative  hypotheses  [Ref.  2]. 

The  general  model  upon  which  Durbin  and  Watson's  work  was  based  is 

Yt  =  Xtp  +  et  (eqn  1.2) 

where 

£t  =  Qtzt_{  +  tir         t=l, T  (eqn  1.3) 

where  Xt  is  a  (1  x  K)  non-stochastic  matrix,  P  is  a  (K  x  1)  vector  and  i  is  one  for  an 
AR(1)  process  and  four  for  an  AR(4)  process  or  a  mixture  of  both  processes.  The 
error  component,  £t,  is  specified  by  the  form  above  and  tk  is  distributed  with  zero 
mean  and  constant  variance. 

Typically  an  ordinary  least  squares  regression  is  performed  on  selected 
independent  variables.  If  the  presence  of  autocorrelation  is  suspected,  the  data  may  be 
tested  for  a  particular  AR  process  utilizing  the  appropriate  form  of  the  Durbin-Watson 
statistic.    For  an  AR(1)  process  the  form  of  the  Durbin-Watson  statistic  is 

7"  i      r     -> 

dl=5>t-et-l>  /E   e  (eqn  1.4) 

i'Z.  **/ 

where 


et  =  Yt 


Y\  (eqn  1.5) 


and 


Yt  =  X$  (eqn  1.6) 

The  estimator  of  P  from  the  ordinary  least  squares  regression  is  p.  The  null  hypothesis 
for  the  test  is  that  of  zero  autocorrelation  in  the  residuals  against  an  alternative 
hypothesis  that  a  first  order  autoregressive  process  exists. 

If  the  presence  of  an  autoregressive  process  is  detected  in  the  regression  residuals, 
the  linear  model  must  be  reestimated  after  transforming  the  original  data.  For  an 
AR(1)  process,  the  independent  variables  are  transformed  as 

Xt*  =  X^l-p^)1/2  t=l  (eqn  1.7) 
and 

Xt*  =  Xt  -  PiXt.j  t=2,...T.  (eqn  1.8) 
In  a  similar  fashion,  the  dependent  variables  are  transformed  as 

Yt*  =  Y^l-pj2)1/2  t=l  (eqn  1.9) 
and 

Yt*  =  Yt-PlYM  t=2,...T.                                                                     (eqn  1.10) 

In  order  to  accomplish  this  transformation  an  estimate  for  pj  is  required. 
Though  this  parameter  can  be  estimated  in  several  ways,  the  most  popular  method  due 
to  simplicity  of  calculation  is 

Pj  =   1  -  .5dj,  (eqn  1.11) 

where  dj  is  given  in  equation  1.4  .  In  addition  to  the  ease  of  calculation,  this  estimator 
performs  as  well  as  more  complex  estimators  which  are  available  [Ref.  3]. 

After  the  data  is  transformed,  generalized  least  squares(GLS)  is  used  to 
reestimate  model  parameters.  After  GLS  is  performed,  the  model  is  rechecked  for  the 
presence  of  autocorrelation  using  the  Durbin-Watson  statistic. 


B.       FOCUS  AND  SCOPE  OF  RESEARCH 

As  previously  mentioned,  regression  data  is  at  times  influenced  by  multiple 
autoregressive  processes.  In  overhead  cost  estimating,  both  an  AR(1)  and  AR(4) 
process  have  been  observed  to  simultaneously  influence  certain  data  sets  [Ref.  1).  In 
the  linear  least  squares  model 

Yt  =  XtP  +  Et,  (eqn  1.12) 

suppose  the  errors  are  serially  correlated  and  of  the  form 

et  =  ejeM  +  e4cM  +  r|r  (eqn  1-13) 

where  the  effects  of  the  second  and  third  prior  quarters  are  negligible  compared  to  the 
effect  of  the  most  recent  and  year-earlier  quarters.  The  t]t's  in  the  model,  as  in  the 
previous  version,  are  independent  and  distributed  with  zero  mean  and  constant 
variance. 

It  is  advised  [Ref.  4:  p.  211]  that  if  this  special  form  of  the  AR(4)  process  exists, 
that  it  be  tested  for  by  a  two-step  procedure.  In  this  procedure,  after  initially 
performing  a  OLS  regression,  the  residuals  are  tested  for  an  AR(1)  disturbance  utilizing 
the  Durbin-Watson  statistic  for  a  first  order  process.  If  the  influence  of  an  AR(1) 
process  is  detected,  a  value  for  pj  is  estimated  and  the  original  data  is  transformed. 
GLS  is  then  performed  on  the  transformed  data  and  the  residuals  are  tested  for  the 
influence  of  a  fourth  order  autoregressive  process  by  means  of  Wallis'  test  [Ref.  5].  If 
an  AR(4)  process  is  present,  a  value  for  p^  is  estimated  and  the  data  is  transformed  a 
second  time.  At  this  point,  the  procedure  is  repeated  to  determine  if  either  form  of 
autocorrelation  is  still  present. 

Unless  the  individual  performing  the  test  has  a  priori  knowledge  that  both  an 
AR(1)  and  AR(4)  process  exist  within  the  data,  it  is  very  likely  that  one  or  the  other 
would  be  overlooked.  Additionally,  peculiarities  in  the  data  may  preclude  the  detection 
of  one  or  the  other  processes.  If  this  occurred,  the  GLS  regression  estimates  as 
previously  discussed  would  be  inefficient.  In  order  to  avoid  the  problem  above,  it 
would  be  desirable  to  have  a  procedure  to  detect  simultaneously  both  an  AR(1)  and 
AR(4)  process  if  they  existed  in  the  data.  This  process,  which  is  a  special  case  of  an 
AR(4)  process  shall  be  specifically  referred  to  as  an  AR(1,4)  process.  In  addition,  if 
the  AR(1,4)  process  exists  then  there  must  be  a  way  to  transform  the  data  and 
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calculate  estimates  for  the  parameters  of  the  process.  This  proposed  procedure  for  the 
AR(1,4)  process  would  prevent  oversights  of  existing  processes  and  would  save  time 
expended  in  having  to  correct  for  the  AR(1)  process  and  then  the  AR(4)  process. 
Most  importantly  though  it  would  insure  that  if  the  AR(1,4)  process  was  present  in  the 
data,  it  would  be  corrected.  In  this  way  any  peculiarities  in  the  data  which  might 
prevent  detection  and  correction  of  either  the  AR(1)  or  AR(4)  process  would  be 
avoided. 

As  previously  discussed,  with  an  autoregressive  disturbance  present  in  the  data, 
the  OLS  assumption  of  independent  error  terms  is  violated.  This  is  related  to  problems 
in  the  error  covariance  matrix  which  is  denoted  as 

E(ee')  =  <r2V|/,  (eqn  1.14) 

where  e  is  a  vector  of  regression  residuals.  The  problem  in  the  error  covariance  matrix 
is  that  there  exist  off  diagonal  terms  not  equal  to  zero.  This  condition  can  indicate 
that  some  form  of  autocorrelation  exists.  An  additional  problem  in  the  error 
covariance  matrix  is  that  the  diagonal  elements  are  not  equal.  This  condition  indicates 
that  the  error  terms  are  not  distributed  with  a  constant  variance.  Errors  distributed  in 
this  manner  cause  estimates  of  regression  coefficients  to  be  inefficient  though  unbiased. 
If  the  error  covariance  matrix  equals 

(T2vj/  (eqn  1.15) 

instead  of 

<r2I  (eqn  1.16) 

A 

where  I  is  an  identity  matrix,  the  regression  coefficient  p  is  correctly  defined  by 
(X'vi/^Xr^'v^Y.  (eqn  1.17) 

A 

The  general  procedure  [Ref.  6:  p.  440]  used  to  attain  an  efficient  estimate  for  p  is: 

•  Find  a  matrix  P  such  that  P'P  =  y    . 

•  Using  the  matrix  P,  transform  the  original  data  set  where 
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Y*  =  PY  (eqn  1.18) 


and 


X*  =  PX.  (eqn  1.19) 


•    Perform  GLS  on  the  transformed  model 


Y*  =  X*p  -f  e*  (eqn  1.20) 


where 


e*  =  Pe  (eqn  1.21) 


to  estimate  the  regression  coefficients  where 

P  =  (X    X  )_1X    Y  .  (eqn  1.22) 


C.       RESEARCH  QUESTIONS 

Following  the  procedure  as  outlined  above,  this  paper  will  address  the  AR(1,4) 
process.    Specifically,  this  thesis  will  attempt  to  develop: 


• 


• 


A  statistic  capable  of  detecting  the  AR(1,4)  disturbance. 

A  P  matrix  for  a  transformation  to  account  for  the  AR(1,4)  disturbance. 

A  method  to  estimate  parameters  required  for  the  transformation. 


D.       RESEARCH  METHODOLOGY 

In  reaching  the  first  objective,  the  statistic  will  be  calculated  by  estimating  the 
distribution  of  the  ratio  of  two  quadratic  forms  in  normal  variables.  This  method  is 
based  on  Imhoffs  [Ref.  7]  technique  and  is  outlined  by  Koerts  and  Abrahamse  [Ref.  8]. 
These  values  shall  be  calculated  for  an  extension  of  Schmidt's  statistic  [Ref.  9]  to  the 
AR(1,4)  process.  This  statistic  was  developed  in  a  manner  similar  to  Durbin  and 
Watson's  and  is  essentially  based  on  their  work.  The  power  of  this  statistic  to  detect 
the  AR(1,4)  process  will  be  compared  to  the  current  sequential  method  of  testing. 
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To  account  for  the  autoregressive  disturbance,  it  is  required  that  there  be  a  P 
matrix  capable  of  transforming  the  data  with  the  property  that 

P'P  =  y"1.  (eqn  1.23) 

The  inverse  of  the  \\f  matrix  for  the  fourth  order  process  was  derived  by  Siddiqui 
[Ref.  10].  With  the  form  of  the  vy  inverse  matrix  known,  a  P  matrix  with  the  above 
stated  property  can  be  found  using  a  method  proposed  by  Beach  and  MacKinnon 
[Ref.  11].  Both  the  \\f  inverse  matrix  and  P  matrix  are  expressed  in  terms  of  0's  or 
autoregressive  parameters  from  the  transfer  function 

e^GjC^j  +  02  ct_2  +  03£t_3  +  ®4C  t-4  +  ^t  (ecin  *-24) 

Utilizing  the  Yule-Walker  equations  [Ref.  12:  p.  55],  the  autoregressive  parameters 
from  this  transfer  function  were  expressed  in  terms  of  the  autocorrelation  coefficients 
or  p's.  Once  the  6's  were  determined  in  terms  of  the  p's,  substitutions  were  made  into 
the  derived  P  matrix.  The  resulting  P  matrix  can  be  used  to  transform  an  AR(4) 
process  as  represented  by  the  error  term  above.  Since  this  error  term  is  not 
representative  of  the  AR(1,4)  model,  corrections  were  made  to  the  P  matrix  by  setting 
appropriate  values  of  p  equal  to  zero.  For  Schmidt's  statistic,  which  will  be  introduced 
in  Chapter  II,  P2  and  p-j  were  set  equal  to  zero.  The  resulting  form  of  the  P  matrix  is 
that  required  to  transform  data  with  an  AR(1,4)  disturbance. 

With  the  P  matrix  defined,  the  next  task  is  to  find  estimates  for  Pj  and  p^.  Since 
the  method  outlined  above  is  generalized  least  squares,  one  approach  for  estimating  the 
values  of  pj  and  p^  is  by  the  sample  correlation  coefficients 

rs  =  X  etet-s'S   et2        s=l'4  (ec*n  L25) 

where  et  is  a  vector  of  least  squares  residuals.  Since  estimators  derived  in  this  manner 
are  subject  to  considerable  small-sample  bias,  the  approach  which  will  be  developed 
will  be  to  estimate  pt  and  p^  by  direct  utilization  of  the  test  statistic.  This  method  is 
analogous  to  Durbin  and  Watson's  method  where 

pj  =   1  -  .5dj  (eqn  1.26) 
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and  di  is  the  Durbin- Watson  statistic  for  an  AR(1)  process. 

After  developing  the  above  outlined  procedure  and  generating  the  bounds  for 
Schmidt's  statistic  a  major  question  which  must  be  answered  is:  How  well  do  the 
procedures  work  in  comparison  to  each  other?  In  order  to  make  the  comparison, 
simulations  will  be  used  which  utilize  least  squares  residuals  generated  for  various 
values  of  pj  and  p^  using  a  formula  for  the  error  of 

ct  =  Plet-1  +  P4ct-4  +  nt  (e<3n  L27) 

where   r\^  is   distributed   normal  with  zero   mean   and   specified   variance   t  .     With 

residuals  calculated  as  such,  the  Schmidt,  Durbin- Watson  and  Wallis  statistics  will  be 

calculated.      In     addition     to     the     Durbin- Watson     statistic,     after     appropriately 

transforming  the  residuals,  the  Wallis  statistic  will  be  used  in  the  two-step  method  to 

test  for  the  presence  of  an  AR(4)  disturbance.    For  various  sets  of  residuals  generated 

with  different  variances  for  the  normal  error  term,  the  power  of  the  statistics  to  detect 

the  AR(1,4)  disturbance  will  be  determined.    Additionally,  the  number  of  observations, 

T,  will  be  varied  by  varying  the  number  of  residual  terms.    Thus  the  power  of  the 

various  statistics  to  detect  the  AR(1,4)  process  will  be  determined  for  different  values 

of  T,  Pj  and  p^  and  t  .    Residuals  used  in  the  above  calculations  will  also  be  used  to 

calculate   estimates    of  Pj    and   p^.     Utilizing    mean    square   error,    calculated    from 

simulation  data,  the  efficiency   of  derived  formulas  to  estimate  P|    and  p^  will  be 

determined.     Specifically,    the    ability    of  both    procedures    to    detect    the    AR(1,4) 

-■) 
disturbance  will  be  determined  for  a  best  case  with  T=  100  and  X    =  .01  and  a  worst 

case  where  T=  20  and  T"6  =   10. 

The  ability  of  the  entire  procedure  derived  for  Schmidt's  statistic  in  estimating  the 

regression  coefficients  will  be  determined  for  a  best  and  worst  case  scenario.    The  best 

case  will  be  for  T-6  =  .01  and  T  =   100,  while  the  worst  case  will  be  for  xL  =   10  and  T 

=  20.   Using  the  regression  model 

Yt  =  pXt  +  £t  (eqn  1.28) 

where 

et  =  Piet-i  +  Pt-4  +  *V  (ec*n  L29) 
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for  Xt  =  .1  and  P  =  1  the  Yt's  are  generated  for  various  values  of  p*  and  p.,.  Once 
the  Yt's  are  determined,  an  estimate  of  P  will  be  calculated.  Mean  square  error  will  be 
used  as  a  measure  to  compare  estimates  of  P  with  their  known  value  of  one.  This 
measure  will  indicate  the  efficiency  of  regression  parameters  estimated  by  the  procedure 
as  based  on  Schmidt's  statistic. 

E.       THESIS  ORGANIZATION 

The  thesis  is  organized  into  four  chapters.  Chapter  II  will  introduce  background 
theory  utilized  to  derive  the  AR(1,4)  procedure.  In  particular,  the  Durbin-Watson  test 
for  the  AR(1)  process  will  be  addressed  along  with  the  Wallis  and  Schmidt  tests  for 
special  cases  of  the  AR(4)  process.  In  addition  to  test  procedures,  the  transformation 
for  autocorrelated  data  will  be  discussed. 

Chapter  III  will  initially  develop  the  test  for  the  AR(1,4)  process.  Specifically, 
upper  and  lower  bounds  will  be  developed  for  the  Boger  and  Schmidt  statistics.  After 
deriving  the  test,  the  data  transformation  for  the  AR(1,4)  process  will  be  addressed. 
Next,  results  will  be  presented  concerning  the  effectiveness  of  the  derived  results  for  the 
AR(1,4)  process.    Finally,  conclusions  and  areas  of  future  study  will  be  discussed. 
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II.  BACKGROUND  THEORY 

A.       GENERAL 

In  this  section  certain  concepts  underlying  all  time  series  processes  will  be 
defined.  These  definitions  will  lay  the  groundwork  which  will  lead  into  discussion  of 
specific  test  procedures.  In  particular,  the  Durbin-Watson,  Wallis  and  Schmidt  tests 
shall  be  discussed.  Discussion  of  these  procedures  will  act  as  a  prelude  to  the  next 
chapter's  development  of  a  test  and  correction  procedure  utilizing  an  extension  of 
Schmidt's  statistic. 

A  sample  of  error  residuals  ej,....e-j-,  where  the  index  denotes  a  point  in  time,  is 
called  a  time  series.  Each  observation  in  this  sample  is  a  realization  of  a  random 
variable  and  as  a  result  this  sequence  is  considered  to  be  a  discrete  stochastic  process. 
The  sequence  is  considered  discrete  since  observations  are  made  at  a  fixed  interval  of 
time.  Additionally,  ej,....ey  is  considered  as  a  finite  subsequence  of  an  infinite  series 
whose  joint  distribution  is  defined  by  these  finite  subsequences. 

Only  stationary  stochastic  processes  will  be  considered  in  this  paper.  A  process  is 
considered  stationary  if  the  joint  probability  distribution  of  the  residuals  ei,....e-r  is 
unaffected  by  shifting  the  time  origin  forward  or  backward  by  k  time  units.  That  is, 
ej,....e-p  and  e^+  i'----ek  +  T'  are  identically  distributed  [Ref.  12]. 

The  general  linear  model  with  autoregressive  disturbance  of  order  p  is  defined  as 

Yt  =  Xt'P  +  et  (eqn2.1) 

with 

ct  =  6let-l  +  -  +  Vt-P  +  nt  (ec*n  2-2> 

where  Yt  is  the  value  of  the  dependent  variable  at  time  t,  Xt'  is  a  (1  x  K)  vector 
representing  an  observation  on  K  non-stochastic  variables  at  time  t  and  P  is  a  (K  x  1) 
vector  of  coefficients  to  be  estimated.  Equation  2.2  implies  that  error  disturbances  in 
the  current  time  period  depend  on  those  in  previous  time  periods  and  another  error 
term  which  is  independent  with  zero  mean  and  constant  variance. 
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Autocovariance  defines  the  linear  relationship  between  members  of  a  stochastic 
sequence.    In  particular,  the  autocovariance,  yk,  between  £t  and  £t  +  k  is  given  by: 

yk  =  E((£t-E(£t))(£t  +  k-E(£t  +  k)))       k=0,l,2,...  (eqn  2.3) 

where  E(£t)  is  the  mean  of  the  stochastic  process,  which  is  zero  in  our  application.  For 
a  stationary  process  the  covariance  between  £t  and  £t_uk  depends  on  k  and  not  on  the 
particular  point  in  time.  The  above  formula  completely  describes  the  autocovariance 
structure  of  the  stochastic  sequence  as  defined  above  [Ref.  13:  p. 226]. 

Autocovariance  is  dependent  on  the  unit  of  measurement  for  the  underlying 
variable.  To  account  for  this,  the  yk's  are  normalized  by  dividing  by  Vq,  which  is  the 
variance  of  £,..  Dividing  yk  by  Jq  results  in  the  autocorrelation  function  of  £t: 

Pk  =  7k  y0-         k=  0,1,2,..  (eqn  2.4) 

From  the  above  definition  of  the  autocorrelation  function  it  is  obvious  that 

P0  =   1-  (eqn  2.5) 

In  order  to  gain  estimates  for  the  0's  or  autoregressive  parameters  in  terms  of  the 
autocorrelation  coefficients  or  p's,  the  Yule-Walker  equation  is  used  [Ref.  12:  p.  55]. 
This  equation  for  the  above  general  linear  model  is  derived  by  left  multiplying  the 
equation 

£t  =  ei£l_1  +  ...  +  ep£t_p-+-Ttt  (eqn  2.6) 

by  Lt,  and  then  taking  expectations  and  dividing  by  the  variance  Jq  of  £r  This 
results  in  the  Yule-Walker  equation 

Pk=eiPk-l  +  "-  +  epPk-p  for  k=  1,2,....  (eqn  2.7) 

By  substituting  k=  1,2,. ..p  into  the  above  equation,  a  set  of  linear  equations  for 
Gj^.-.B  in  terms  of  p  j,p2--PD  are  obtained.  By  substituting  estimates  for  pj^.-Pp 
into  the  above  equations,  the  autoregressive  parameters  are  estimated.    Various  other 
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methods  exist  to  estimate  the  p's,  and  some  will  be  discussed  later.  This  result  is 
important  in  the  specification  of  the  autoregressive  process  and  will  help  in  the 
development  of  a  transformation  matrix  for  the  autocorrelated  data  [Ref.  12]. 

B.       DURBIN- WATSON  TEST 

This  section  summarizes  the  most  important  features  of  the  theory  for  the 
Durbin-Watson  test  [Ref.  2,14,15].  This  theory  is  the  basis  for  later  generalizations  of 
tests  for  autoregressive  disturbances. 

The  Durbin-Watson  test  is  based  on  the  statistic 


j    7* 

d  =  E(et_et-i)  /£et     e'Ae/e'e 


r 

K 

where  e  =  y  -  Xp  is  a  vector  of  least  squares  residuals  and  A  equals: 


(eqn  2.8) 
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For  certain  cases  where  regression  vectors  are  eigenvectors  of  the  matrix  A 
occurring  in  the  residuals  distribution,  the  statistic 


e'Ae  /  e'e 


(eqn  2.9) 


provides  a  test  which  is  uniformly  most  powerful  against  certain  alternative  hypotheses 
[Ref.  16:  p.  88]. 

The  general  linear  model  was  previously  defined  as 


Y  =  Xp  +  £,  (eqn2.10) 

where  Y  is  a  (T  x  1)  matrix,  X  is  a  (T  x  K)  matrix  of  T  observations  on  K  variables 
and  £  is  a  (T  x  1)  matrix  of  errors.  The  least  squares  estimate  of  p  is  P  which  is  given 
by: 

$=  (X'XylX'y.  (eqn2.11) 

Also,  e,  the  vector  of  residuals  from  the  regression,  is  defined  as 

A 

e  =  y-Xp  =  My  (eqn2.12) 

where  I  is  an  identity  matrix  of  order  T,  and 

M  =  I  -XCX'X^X'.  (eqn2.13) 

It  can  be  verified  that  the  matrix  M  is  idempotent.  Substituting  Xp  +  e  for  y  into  the 
formula  for  e  leads  to  the  result 

e  =  Me.  (eqn  2.14) 

This  result  implies  that 

d  =  e'MAMe  /  e'Me.  (eqn  2.15) 

The  distribution  of  the  above  statistic  is  dependent  on  the  (T  -  K)  non-zero  eigenvalues 
of  AM  which  are  denoted  by  8j.  Values  for  6  are  dependent  on  M  through  the  matrix 
X. 

The  matrix  A  is  a  (T  x  T)  symmetric  matrix  with  (T  -  1)  positive  eigenvalues 
which  for  T  greater  than  or  equal  to  three  lie  in  the  closed  interval  from  zero  to  four. 
The  eigenvalues  of  A  will  be  denoted  by  X-  and  are  defined  as 

h  =  2(1  -cos(7tj/T))        j  =   1,....,T-1.  (eqn  2.16) 
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Durbin  and  Watson  showed  that  if  the  et  are  assumed  to  be  normally  distributed, 
there  exists  an  orthogonal  linear  transformation  of  e  to  v  such  that 

d-  IVi    /Ivi  •  (eqn2.17) 

/=/  /»/ 

where  V:  are  independent  and  identically  distributed  normal  variables  with  zero  mean 
and  variance  <y  .  The  distribution  of  d  thus  depends  on  the  6{  which  are  the 
eigenvalues  of  M  [Ref.  2:  p.  412]. 

Using  the  above  result  to  obtain  the  distribution  of  d  is  tedious,  since  the  6-'s 
depend  on  the  X  matrix  and  the  statistic  would  have  to  be  calculated  for  each  new 
matrix.  Consequently,  if  it  is  desired  to  have  a  test  which  is  not  restricted  to  a 
particular  X  matrix,  the  distribution  would  have  to  be  determined  for  all  X  matrices. 
This  task  is  impossible  since  there  are  an  infinite  number  of  matrices. 

Durbin  and  Watson  avoided  this  problem  by  constructing  a  bounds  test  where 
the  upper  and  lower  bounds,  dy  and  di  ,  respectively,  are  independent  of  the  particular 
X  matrix.  They  were  able  to  accomplish  this  by  determining  inequalities  on  the  6  in 
terms  of  the  eigenvalues  X-  of  the  matrix  A,  where  the  S-'s  and  "k-'s  have  been  arranged 
in  increasing  order.   This  result  implies  that 

dL<d<dy  (eqn2.18) 

where 

dU  =  S\+Kvi2/i>i2'  (eqn2.19) 

and 

dL  =  2>ivi  /  £ v  i  •  (^n  2-2°) 

Durbin  and  Watson  were  unable  to  find  the  exact  distribution  of  these  bounds  but 
approximated  the  distribution  with  the  first  four  terms  of  a  series  expansion  in  terms  of 
Jacobi  polynomials  using  a  Beta  distribution  as  a  weight  function. 
Using  the  statistic 

7-  r  . 

K  efet-l  )    /  Iet  ■  (eqn2.21) 
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and  asymptotic  results  it  can  be  shown  that 

d  =  2(  1  -  p  ),  (eqn  2.22) 

where  p  is  an  estimate  of  the  autocorrelation  coefficient.  This  result  shows  that  when 
p  equals  zero,  indicating  no  autocorrelation,  d  equals  two.  In  addition,  when  p  equals 
one  indicating  positive  autocorrelation,  d  equals  zero  and  when  p  equals  negative  one, 
d  equals  four.  In  testing  the  null  hypothesis  of  no  autocorrelation  against  the 
alternative  hypothesis  of  positive  autocorrelation,  the  null  hypothesis  would  be  rejected 
if  the  calculated  value  for  d  is  less  than  the  lower  bound  calculated  and  accepted  if  d  is 
greater  than  the  upper  bound.  If  the  calculated  value  of  d  falls  between  the  upper  and 
lower  bounds  the  test  is  considered  inconclusive  and  there  is  insufficient  evidence  to 
accept  or  reject  the  null  hypothesis. 

Since  Durbin  and  Watson  first  calculated  upper  and  lower  bounds  for  their 
statistic  using  an  approximate  distribution,  a  method  has  been  developed  by  Imhoff 
which  allows  the  calculation  of  an  exact  distribution  for  a  quadratic  form  in  normal 
variables  [Ref.  7:  p.  419].  To  briefly  indicate  how  the  Imhoff  distribution  is  utilized  in 
the  calculation  of  the  statistic's  bounds,  only  the  lower  bound  will  be  considered.  The 
calculation  of  the  upper  bound  can  be  done  in  an  analogous  manner.  In  order  to 
calculate  the  lower  bound,  the  eigenvalues  of  the  A  matrix,  "k-,  are  calculated.  For  the 
A  matrix,  there  will  be  (T  -  1)  non-zero  eigenvalues  where  T  is  the  order  of  the  A 
matrix  and  the  one  zero  term  is  the  eigenvalue  which  corresponds  to  the  constant  term 
of  the  regression.  From  these  eigenvalues,  the  (T  -  K)  smallest  are  selected  to  compute 
the  statistic's  lower  bound  [Ref.  8:  p. 70].  The  value  for  K  is  equal  to  the  number  of 
regressors  including  a  constant  term.  The  basis  for  the  above  procedure  is  the  result 
proved  by  Durbin  and  Watson  which  determined  inequalities  of  the  eigenvalues  6-  of 
the  matrix  AM  in  terms  of  the  eigenvalues  X-  of  the  A  matrix  or 

h-6\-hi+K         ■  -1A-.T-K,  (eqn  2.23) 

where 

K'  =  K  -  1  (eqn  2.24) 
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and  K'  is  the  number  of  regressors  not  including  a  constant  term.  Analogously,  to 
calculate  the  upper  bound  the  (T  -  K)  largest  eigenvalues  are  selected.  Adapting  the 
I  mhofT  distribution  to  this  problem  leads  to 

00 

F(dL)  =  1/2  -  l/7tj  ((  sin  (p(u))  /  (u(o(u»)  du  (eqn  2.25) 

o 

where 

(p(u)  =  .5  £  tan"1(u:u)  (eqn  2.26) 


co(u)  =  n(l  +  UiV)1/4  (eqn  2.27) 

/*/ 

and 


uj  =  Xj  -  dL  (i  =   1, T-K)  (eqn  2.28) 

Since  the  X-'s  are  known,  a  value  for  dr  is  selected  and  the  u-'s  are  calculated. 
Using  the  u-'s,  the  integral  is  then  evaluated  numerically.  Numerical  integration  of  this 
infinite  integral  is  possible  since  I  mhofT  has  proved  that  the  integral's  limit  as  u 
approaches  zero  is 

1/2E  uj.  (eqn  2.29) 

Since  the  integral's  range  is  infinite,  there  will  be  a  truncation  error  associated  with  this 
integration.  ImhofThas  shown  that  the  truncation  error  caused  by  integrating  over  the 
finite  range  from  0  to  u  inclusive,  can  be  held  to  any  arbitrary  level  (i  by  taking 

u  =  (((T  -  K)/2)  TCjljn  \u:\1/2  )8  (eqn  2.30) 

where  g  =  2/(T-K).  Additionally,  the  numerical  integration  will  be  subject  to  error 
related  to  the  method  of  integration.  This  error  is  controlled  by  setting  the  error 
tolerance  within  the  program  used  to  integrate  the  function. 
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C.       EXTENSIONS  TO  HIGHER  ORDER  PROCESSES 

Durbin  and  Watson's  original  work  has  been  extended  to  higher  order 
autoregressive  processes.  Specifically,  Wallis  has  extended  it  to  a  special  case  of  an 
AR(4)  process  and  Schmidt  has  extended  it  to  a  second  order  process.  For  the 
regression  model 

Y  =  XP  +  e,  (eqn2.31) 

with  an  error  specification 

£t  =  p£M  +  Tit,  (eqn  2.32) 

Wallis  generated  bounds  for  the  statistic 

T  T 

d4  =  S(et  "  et-4)     Xet  =  eA4e  '  ee>  (e(ln  2>33) 

Wallis'  derivation  of  this  result  is  analogous  to  Durbin  and  Watson's.  Using 
ImhofTs  distribution,  Wallis  was  able  to  calculate  bounds  for  this  special  case  of  the 
AR(4)  process  [Ref.  5:  p.  617].  Vinod,  in  a  manner  similar  to  Wallis,  further  extended 
these  results  to  include  bounds  for  the  same  special  cases  of  the  AR(2)  and  AR(3) 
processes  [Ref.  17]. 

In  order  to  account  for  an  AR(1)  or  AR(4)  disturbance,  the  data  must  be 
transformed.  In  this  regard,  for  the  AR(1)  process  the  dependent  variables  are 
transformed  as 

Yt*  =  Yt(l  -  Pl2)1/2,         t=l  (eqn  2.34) 

and 

Yt*  =  Y^PjY^,  t=2,3, T.  (eqn  2.35) 

Similarly,  the  independent  variables  are  transformed  as: 

Xt*  =  Xt(l  -  pj2)1  2,         t=l  (eqn  2.36) 
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and 

Xt*  =  Xt  -  ?{Xt_v         t=  2,3, T.  (eqn  2.37) 

In  order  to  utilize  this  transformation,  an  estimate  for  pj  is  required.  Several 
methods  are  available  to  estimate  this  parameter.  The  method  of  choice  due  to  its 
favorable  properties  and  ease  of  computation  [Ref.  3]  is 

Pj  =  1  -  .5dj.  (eqn  2.38) 

For  the  special  case  of  the  AR(4)  process  the  transformation  is  similar. 
Specifically,  the  dependent  variables  are  transformed  as: 

Yt*  =  Yt(l  -  p42)*/2  t=  1,2,3,4  (eqn  2.39) 
and 

Yt*  =  Yt  -  p4Yt.4         t=  5, T.  (eqn  2.40) 

The  independent  variables,  similarly  are  transformed  as: 

Xt*  =  Xt(l  -  P42)1/2  t=  1,2,3,4  (eqn  2.41) 
and 

Xt*  =  Xt  -  p4Xt.4         t=  5, T.  (eqn  2.42) 

As  in  the  AR(1)  case,  an  estimate  of  p4  is  required.  The  formula  for  this  estimate  is 
the  same  as  before  except  that  the  Wallis  statistic  is  used  in  lieu  of  the  Durbin-Watson. 
Once  either  of  these  transformations  is  utilized  after  the  detection  of  its  respective 
disturbance,  estimates  of  regression  parameters  can  be  expected  to  be  efficient  and 
unbiased. 

Schmidt  [Ref.  9]  generalized  Durbin  and  Watson's  preliminary  work  to  the  case 
of  a  second  order  autoregressive  process  whose  disturbance  is  of  the  form: 
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£t  =  pi£m  +  p2£t-2  +  nt 


(eqn  2.43) 


The  statistic  which  Schmidt  derived  to  detect  this  disturbance  was: 


d2  =  d,  +    62 


(eqn  2.44) 


where 


d.^Kve^/ix2 


i*L 


<*i 


and 


r  r 

82  =  2>t  -  et-2>    /  Iet  • 

In  matrix  notation  this  statistic  is  represented  as 


(eqn  2.45) 


(eqn  2.46) 


&2  =  e'(Aj  +  A2)e  /  ee 


(eqn  2.47) 


where  Aj  is  the  matrix  as  previously  defined  for  the  Durbin-Watson  statistic  and  A2  is 
defined  as 
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Following  results  proved  by  Durbin  and  Watson,  Schmidt  generated  the  statistic,  d2, 
utilizing  ImhofTs  distribution  and  the  eigenvalues  from  the  matrix  (Aj  +  A2).  In  a 
manner  similar  to  previous  developments,  he  further  showed  that 

d2  =  2(2  -  Pi  -p2).  (eqn2.48) 


D.       SEQUENTIAL  TESTING 

In  development  of  a  generalization  of  the  Durbin- Watson  statistic,  Vinod, 
derived  a  scheme  of  sequential  tests  for  the  presence  of  autocorrelation  [Ref.  17].  In 
this  procedure  a  sequence  of  tests  is  used  to  determine  if  AR  processes  of  sequentially 
increasing  order  exist.  As  an  example,  the  null  hypotheses  for  the  four  tests  required 
to  detect  a  fourth  order  process  would  be: 

•  H0:pj  =  0, 

•  H0 


H0 
H0 


p2  =  0  given  pj  =  0, 

p3  =  0  given  pj  =  p2  =  0, 

p4  =  0  given  pj  =  p2  =  p3  =  0. 

Thus  in  order  to  test  for  an  autoregressive  disturbance,  the  null  hypotheses  for 
sequentially  greater  orders  are  tested  until  one  of  the  null  hypotheses  is  rejected. 
Specifically,  at  the  j  step,  j  >  1,  the  hypothesis,  Hq  :  P:  =  0  given  that  pj  =  p2  = 
•••  =  Pj-i  =  0  is  tested  against  the  two-sided  alternative  H„  :  p:  >  0  and  p:  <  0. 
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III.  THE  AR(1,4)  PROCESS 

A.       INTRODUCTORY  ANALYSIS 

In  this  section  stationary  conditions  and  the  development  of  the  error  term  for 
the  AR(1,4)  process  will  be  presented.  If  stationary  conditions  do  not  hold  the 
AR(1,4)  model  is  invalid  and  a  model  which  considers  a  dynamic  specification  must  be 
pursued.  In  particular,  stationary  conditions  must  be  determined  for  the  regression 
model 

Yt  =  Xt'p  +  et,  (eqn3.1) 

whose  error  term  is  given  by: 

£t  =  ei£t-l+e4£t-4  +  Tlr  (eqn3.2) 

As  discussed  in  previous  chapters,  the  T|t's  are  assumed  to  be  distributed  normally  with 
zero  mean  and  constant  variance. 

Wise  [Ref.  18]  derived  a  method  to  determine  the  stationary  conditions  for  a 
stochastic  process  of  the  autoregressive  and  moving  average  types.  Following  his 
procedure,  it  is  possible  to  show  that  the  stationarity  conditions  for  a  fourth  order 
autoregressive  process  are 

•  2  +  Pl-p3  +  2p4  >  0, 
3  +  p2  -  3p4  >  0, 
2-Oj  +  p3  +  2p4  >  0, 

1  "Pi  -P2-P3-P4  >  °- 

•  1  +  Pi  -  P2  +  P3  "  P4  >  °- 

Since  p2  and  p^  are  assumed  to  be  zero  in  the  AR(1,4)  process,  these  stationarity 

conditions  reduce  to 

•  2  +  pj  +  2p4  >  0, 

•  3(1  -p4)  >  0, 

•  2-p{  +  2p4  >  0, 

•  I-P1-P4  >  o, 

•  1  +  Pi  -P4  >  °- 
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The  error  process  for  a  general  fourth  order  process  is  defined  by  the  relation: 

ct  =  eiet-l+e2ct-2  +  e3£t-3  +  04ct-4-  (^n  3'3) 

If  the  relationship  is  multiplied  through  by  c^^  and  expectations  are  taken,  the 
difference  equation 

Yk  =  eiYk-l  +  e2Yk-2  +  -  +  04Yk-4        k>°  (Cqn  3>4) 

results,  where  y  was  previously  defined  as  the  autocovariance.  If  this  equation  is 
divided  through  by  Jq,  the  result  is 

Pk=eiPk-l+02Pk-2+-  +  e4Pk-4'  (eqn  3.5) 

which  is  the  general  form  of  the  Yule-Walker  equation  and  defines  the  autoregressive 
parameters,  0's,  in  terms  of  the  autocorrelations,  p's.  If  k=  1,  2,  3,  4,  is  substituted 
into  this  equation  a  set  of  linear  equations  is  obtained  for  8i,  02,  63  and  04  in  terms  of 
Pj,  P2,  P3  and  p^.    Specifically,  these  equations  are 

p1=01  +  02p1  +  03p2  +  04p3,  (eqn  3.6) 


p2  =  01p1+02  +  03p1+04p2,  (eqn  3.7) 

p3  =  01p2  +  02p1+03  +  04p1,  (eqn  3.8) 

and 

P4  =  01P3  +  02P2  +  e3Pl+04-  (eqn  3.9) 

Since  the  error  process  for  the  AR(1,4)  process  is  specified  as 

£t  =  01£t.1+04ct.4  +  T)t,  (eqn  3.10) 
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where  02  and  63  are  assumed  equal  to  zero,  the  above  equations  can  be  reduced  to 

Pi  m  ©1  +  64P3,  (eqn  3.11) 

p2  =  GjP!  +  e4p2,  (eqn  3.12) 

p3  =  0^2  +  94plf  (eqn  3.13) 

and 

P4  =  0^3  +  04-  (eqn  3.14) 

Further  since  it  is  assumed  that  pj  and  P3  are  not  present  in  the  disturbance,  the  above 
equations  can  be  finally  reduced  to 

Pi  -  01  (eqn  3.15) 
and 

P4  =  04-  (eqn  3.16) 
These  final  equations  allow  the  AR(1,4)  error  term  to  be  represented  as: 

ct  =  Plct-1  +  P4ct-4  +  ^t  (eqn  3J7) 


B.       BOGER'S  STATISTIC 

Boger  proposed  a  statistic  which  was  a  variation  of  the  Durbin-Watson  statistic 
to  test  for  the  AR(1,4)  disturbance.   This  statistic  is  of  the  form 

T  T 

di,4=S  (vet-ret-4)  /  Eet  ■  (eqn  3-18) 


In  matrix  notation  this  statistic  is  denoted  as 


djj4=e'Alj4e/  e'e 


(eqn  3.19) 


where  Aj  4  equals 
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Bounds  for  Boger's  statistic  could  not  be  determined  because  its  A  matrix  as 
defined  above  did  not  satisfy  properties  as  established  by  Durbin  and  Watson.  In 
particular,  Durbin  and  Watson  [Ref.  2]  showed  that  the  matrix  product  MA,  where  A 
is  any  real  symmetric  matrix  has  (T-K)  real  positive  eigenvalues  of  which  K  are  zero 
eigenvalues.  T  and  K  are  the  dimensions  of  the  matrix  X.  Specifically,  Durbin  and 
Watson  showed  for  their  statistic  whose  A  matrix  equals 
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that  there  were  T-l  positive  eigenvalues.  As  previously  discussed,  they  also  showed 
that  if  the  positive  eigenvalues  from  both  MA  and  A  are  placed  in  increasing  order, 
bounds  for  statistics  of  the  form 

d  =  e'Ae  /  e'e,  (eqn  3.20) 

can  be  calculated. 

Where  Boger's  statistic  violated  these  properties  was  that  for  all  values  of  K  or 
the  number  of  independent  variables,  there  were  not  sufficient  eigenvalues  in  the  A 
matrix  to  bound  those  eigenvalues  in  the  matrix  MA.  Specifically,  the  A  matrix  for 
Boger's  statistic  had  one  zero  and  three  negative  eigenvalues  where  Durbin  and 
Watson's  had  one  zero  eigenvalue.  Since  the  matrix  MA  had  T-K  positive  eigenvalues 
and  Boger's  A  matrix  only  had  T-4  positive  eigenvalues  instead  of  T-l,  when  the 
number  of  independent  variables  was  less  than  or  equal  to  three,  there  were  not 
sufficient  eigenvalues  from  Boger's  A  matrix  to  bound  those  from  the  matrix  MA. 
Since  this  violated  Durbin  and  Watson's  theory,  upper  and  lower  bounds  could  not  be 
calculated  for  Boger's  statistic,  when  the  number  of  independent  variables  was  less  than 
three. 

C.       SCHMIDT'S  STATISTIC  EXTENDED  TO  THE  AR(1,4)  PROCESS 

Since  it  was  desirable  to  have  a  statistic  which  was  not  limited  by  the  number  of 
independent  variables,  a  variation  of  Schmidt's  statistic  to  fit  the  AR(1,4)  process  was 
investigated.   Schmidt  was  able  to  show  that  the  statistic  d2  where 

d2  =  dj  +  dj  (ecln  3-21) 

and 

h  -  Bet  -  et-2>2  /  Iet2  (^n  3-22> 

provides  a  test  to  detect  a  second  order  autoregressive  disturbance. 

Using  Schmidt's  research  this  paper  proposes  that  it  is  possible  to  detect  the 
AR(1,4)  process  by  way  of  the  statistic  dj  4  where: 

dl,4  =  dl  +  d4  (eqn  3.23) 
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or 


dl,4=f(efet-l)2/fet2  +  f(et-et.4)2/ f et2. 
In  matrix  notation  this  statistic  is  given  as 


(eqn  3.24) 


e'Aj  ^e  /  e'e 
where 


(eqn  3.25) 


A1>4  =  Aj  +  A4 


(eqn  3.26) 


or  A  j  ^  equals: 

"2-1      0     0-1      0     0     0     0     0 
-13-100-10000 

0-13-100-100. 

0      0-1       3-1      0      0-1      0      0 
■1      0      0-1      4-1      0      0-1      0 


-1      0      0-1       3-1 

-1      0      0-1       2 

Results  established  by  ImhofT  and  Durbin  and  Watson  were  used  to  determine 
upper  and  lower  bounds  for  the  statistic  dj  a>  Koerts  and  Abrahamse  [Ref.  8]  have 
provided  Fortran  66  code  which  allows  the  determination  of  the  distribution  of  a 
quadratic  form  in  normal  variables  by  way  of  ImhofTs  distribution.  This  code  was 
updated  to  Fortran  77  standards  and  modified  to  allow  the  calculation  of  the  dj  a 
statistic.  The  algorithm  utilizes  ImhofTs  distribution  to  determine  five  percent 
significance  points  of  the  statistic  by  successively  halving  the  range  of  the  function 
until 


F(dL)  =  .05, 


(eqn  3.27) 


32 


where  F(d^)  is  the  ImhofT  distribution  function  as  defined  in  the  previous  chapter. 
Truncation  and  integration  errors  are  controlled  by  input  parameters.  It  was  found 
that  if  these  errors  were  set  to  less  than  .0001,  the  program  required  inordinate 
amounts  of  CPU  time.  At  a  .0001  level  of  accuracy,  the  five  percent  significance  points 
utilized  S2500  of  computer  resources.  At  a  .00001  level  of  accuracy,  approximately  a 
six  fold  increase  in  resources  can  be  expected.  Five  percent  significance  points  for  the 
statistic  dj  4  are  listed  in  Table  I.  In  Table  I,  K  is  the  number  of  regressors  including 
the  intercept  term  and  T  is  the  number  of  observations. 

Currently,  if  an  AR(1,4)  process  is  suspected  to  exist  in  the  data  a  two-step 
procedure  is  generally  used.  Specifically,  this  involves  first  testing  and  correcting  for 
the  AR(1)  component  of  the  disturbance  and  then  doing  the  same  for  the  AR(4) 
component.  The  AR(1,4)  procedure  proposes  to  test  and  correct  the  AR(1,4) 
disturbance  in  one  step  where  both  components  are  simultaneously  corrected.  Once 
autocorrelation  has  been  determined  to  exist  within  the  data  for  both  procedures, 
values  for  the  autocorrelation  coefficients  Pj  and  p^  must  be  determined.  In  the 
two-step  procedure,  pj  is  calculated  by  way  of  the  formula 

Pj  =   1  -  .5dlf  (eqn  3.28) 

where  dj  is  the  Durbin-Watson  statistic.  After  an  appropriate  transformation,  the  data 
are  tested  in  a  sequential  manner  as  discussed  in  the  previous  chapter  until  a  higher 
order  disturbance  is  detected.  If  a  fourth  order  disturbance  is  detected,  p^  is  estimated 
by  the  formula 

p4  =  1  -  .5d4,  (eqn  3.29) 

where  d^  is  the  statistic  tabulated  by  Wallis.  As  can  be  seen  by  this  procedure,  the 
calculation  of  values  for  pi  and  p^  are  in  a  sense  conditional  upon  each  other.  That  is 
since  p/s  value  is  used  as  part  of  the  original  data  transformation  and  p^  is  determined 
by  way  of  the  Wallis  statistic  which  is  calculated  from  the  transformed  data,  the  value 
of  p^  depends  on  the  value  of  pj. 

There  are  two  important  assumptions  concerning  the  two-step  procedure  as  it 
relates  to  the  AR(1,4)  disturbance.  The  first  of  these  is  that  the  Durbin-Watson 
statistic  will  be  used  to  detect  the  AR(1,4)  process.    That  is,  that  an  individual  who  is 
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TABLE  I 
TABLE  OF  BOUNDS  FOR  THE  SCHMIDT  STATISTIC 


K  -- 

2 

3 

4 

T 

L 

U 

L 

U 

L 

U 

15 

2.38 

2.84 

2.19 

3.05 

2.03 

3.29 

16 

2.43 

2.87 

2.26 

3.06 

2.10 

3.30 

17 

2.48 

2.91 

2.31 

3.10 

2.17 

3.30 

18 

2.53 

2.94 

2.36 

3.13 

2.23 

3.30 

19 

2.57 

2.97 

2.42 

3.15 

2.27 

3.32 

20 

2.61 

3.00 

2.47 

3.16 

2.33 

3.34 

21 

2.65 

3.02 

2.51 

3.18 

2.37 

3.35 

22 

2.68 

3.03 

2.55 

3.20 

2.42 

3.35 

23 

2.72 

3.05 

2.58 

3.22 

2.46 

3.36 

24 

2.75 

3.08 

2.62 

3.24 

2.50 

3.37 

25 

2.78 

3.09 

2.65 

3.26 

2.54 

3.38 

26 

2.80 

3.11 

2.69 

3.26 

2.57 

3.39 

27 

2.82 

3.12 

2.71 

3.28 

2.60 

3.40 

28 

2.85 

3.14 

2.74 

3.29 

2.63 

3.41 

29 

2.87 

3.15 

2.77 

3.30 

2.66 

3.42 

30 

2.90 

3.17 

2.80 

3.31 

2.69 

3.43 

31 

2.92 

3.17 

2.82' 

3.32 

2.72 

3.44 

32 

2.94 

3.19 

2.84 

3.33 

2.74 

3.44 

33 

2.96 

3.20 

2.86 

3.33 

2.77 

3.45 

34 

2.98 

3.21 

2.88 

3.34 

2.79 

3.46 

35 

2.99 

3.22 

2.90 

3.35 

2.81 

3.46 

36 

3.01 

3.23 

2.92 

3.36 

2.83 

3.47 

37 

3.03 

3.24 

2.94 

3.36 

2.85 

3.48 

38 

3.04 

3.25 

2.96 

3.37 

2.87 

3.48 

39 

3.05 

3.26 

2.97 

3.38 

2.89 

3.49 

40 

3.07 

3.27 

2.99 

3.38 

2.91 

3.49 

45 

3.13 

3.31 

3.05 

3.41 

2.98 

3.50 

50 

3.18 

3.34 

3.12 

3.43 

3.05 

3.52 

55 

3.23 

3.37 

3.16 

3.46 

3.10 

3.54 

60 

3.26 

3.40 

3.21 

3.47 

3.15 

3.55 

65 

3.30 

3.42 

3.24 

3.49 

3.19 

3.56 

70 

3.33 

3.44 

3.28 

3.51 

3.23 

3.57 

75 

3.35 

3.46 

3.31 

3.52 

3.26 

3.58 

80 

3.38 

3.47 

3.34 

3.53 

3.30 

3.59 

85 

3.40 

3.49 

3.36 

3.54 

3.32 

3.59 

90 

3.42 

3.50 

3.38 

3.55 

3.34 

3.60 

95 

3.43 

3.51 

3.40 

3.56 

3.36 

3.61 

100 

3.45 

3.53 

3.42 

3.57 

3.38 

3.62 
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TABLE  I 
TABLE  OF  BOUNDS  FOR  THE  SCHMIDT  STATISTIC  (CONT'D.) 


K-- 

5 

6 

T 

L 

U 

L 

U 

15 

1.87 

3.47 

1.73 

3.74 

16 

1.95 

3.47 

1.82 

3.73 

17 

2.01 

3.47 

1.88 

3.72 

18 

2.07 

3.48 

1.95 

3.71 

19 

2.13 

3.49 

2.01 

3.70 

20 

2.19 

3.49 

2.07 

3.68 

21 

2.24 

3.50 

2.12 

3.67 

22 

2.29 

3.51 

2.17 

3.66 

23 

2.33 

3.52 

2.22 

3.66 

24 

2.38 

3.52 

2.27 

3.65 

25 

2.42 

3.52 

2.31 

3.64 

26 

2.45 

3.53 

2.35 

3.64 

27 

2.49 

3.53 

2.39 

3.65 

28 

2.52 

3.54 

2.42 

3.65 

29 

2.56 

3.54 

2.45 

3.65 

30 

2.59 

3.54 

2.49 

3.65 

31 

2.61 

3.55  . 

2.52 

3.65 

32 

2.65 

3.55 

2.55 

3.65 

33 

2.67 

3.55 

2.58 

3.65 

34 

2.70 

3.56 

2.61 

3.65 

35 

2.72 

3.56 

2.63 

3.66 

36 

2.75 

3.57 

2.66 

3.66 

37 

2.77 

3.57 

2.68 

3.66 

38 

2.79 

3.57 

2.70 

3.66 

39 

2.80 

3.57 

2.72 

3.66 

40 

2.83 

3.58 

2.74 

3.66 

45 

2.91 

3.59 

2.84 

3.66 

50 

2.98 

3.60 

2.92 

3.67 

55 

3.05 

3.62 

2.99 

3.68 

60 

3.10 

3.62 

3.04 

3.69 

65 

3.15 

3.63 

3.09 

3.69 

70 

3.18 

3.63 

3.13 

3.69 

75 

3.22 

3.64 

3.18 

3.69 

80 

3.25 

3.64 

3.21 

3.70 

85 

3.28 

3.65 

3.24 

3.70 

90 

3.30 

3.66 

3.27 

3.71 

95 

3.33 

3.66 

3.29 

3.71 

100 

3.35 

3.66 

3.32 

3.71 
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testing  for  the  AR(1,4)  process  will  not  calculate  the  Wallis  statistic  if  the 
Durbin- Watson  statistic  fails  to  detect.  This  assumption  should  be  fairly  accurate  in 
practice  as  an  individual  would  most  likely  decide  that  autocorrelation  is  not  present  if 
the  Durbin- Watson  statistic  indicated  such.  The  second  assumption  which  is  closely 
related  to  the  first,  concerns  the  order  of  calculation  for  Pj  and  p^.  Specifically,  as 
previously  shown,  the  formula  derived  from  the  Schmidt  statistic  contains  a  pi  and  p^ 
term.  Since  this  is  the  case,  different  results  might  have  been  obtained  if  p^  had  been 
calculated  by  way  of  the  Wallis  statistic  formula  first,  after  which  p  j  would  have  been 
calculated  by  the  Schmidt  statistic  formula.  This  possibility  was  not  investigated  in 
this  paper,  but  pj  was  calculated  by  way  of  Durbin  and  Watson's  statistic  after  which 
P4  was  determined  from  the  Schmidt  formula. 

For  the  AR(1,4)  procedure  values  for  pj  and  p^  will  be  calculated  in  a  similar 
manner.   Specifically,  it  is  proposed  that  P|  will  be  calculated  by  the  formula 

pj  =   1  -  .5dj  (eqn  3.30) 

and  P4  by  the  formula 

p4  =  2  -  pj  -  .5dlj4.  (eqn  3.31) 

The  formula  for  p4  is  derived  from  the  original  definition  of  dj  4  or 

di,4  -  (I  (vet-i)2  +  £ (vet-4)2)  /  5>t2-  (ec*n  3-32> 

4*2.  i-S  *•/ 

If  the  formula  for  dj  4  is  factored  and  asymptotic  arguments  applied,  after  appropriate 
rearrangement  of  terms,  the  formula  for  p4  is  achieved.  The  validity  of  these  formulas 
is  argued  in  a  manner  similar  to  that  for  the  two-step  procedure.  The  difference  is  that 
values  for  P|  and  P4  are  determined  in  a  single  step.  An  advantage  to  this  method  is 
that  a  value  for  P4  is  calculated  prior  to  the  data  transformation,  thus  its  value  will  not 
be  influenced  by  any  peculiarities  which  may  have  occurred  during  the  data 
transformation. 

Once  estimates  for  Pj  and  P4  have  been  determined,  a  method  to  utilize  them  in 
correcting  the  AR(1,4)  disturbance  must  now  be  determined.  As  previously  discussed, 
this  requires  that  a  matrix  P  be  found  such  that  P'P  =  xj/"1  where  y     is  the  inverse  of 
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the  sample  covariance  matrix.    Once  the  P  matrix  is  determined,  the  data  can  be 
transformed  such  that 


Y    =  PY, 


(eqn  3.33) 


X     =  PX, 


(eqn  3.34) 


e     =  Pe, 


(eqn  3.35) 


and  least  squares  applied  to  the  model 


Y    =  X  P  +  e  , 


(eqn  3.36) 


where  the  estimator  for  P  is 


P  =  (X'P'PX^X'P'PY. 


(eqn  3.37) 


If  p  is  determined  from  the  transformed  data,  it  should  be  more  efficient  than  estimates 
from  ordinary  least  squares. 

Beach    and    MacKinnon    [Ref.  11],    in    their    paper    on    maximum    likelihood 
estimation  of  a  second  order  process,  defined  the  P  matrix  as 


a 0 

be 0 

?2  -Pi     * ° 

P -P2  -Pi  1 
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They  proposed  a  secondary  method  to  determine  values  for  a,  b  and  c  where  P'P  is 
constrained  to  be  proportional  to  the  inverse  of  the  variance-covariance  matrix  as 
given  be  Siddiqui  [Ref.  10]  and  a,  b  and  c  are  solved  for  in  the  implied  restrictions.  In 
a  manner  similar  to  that  proposed  by  Beach  and  MacKinnon,  this  paper  will  solve  for 
the  P  matrix  of  the  fourth  order  process. 

From  Siddiqui's  paper  on  the  inversion  of  the  sample  covariance  matrix  it  was 
possible  to  determine  that  the  inverse  of  the  sample  covariance  matrix,  V|/    ,  equals 
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where  a  ^  =  p  j  and  a^=p^.  This  matrix  was  obtained  from  the  \\i  matrix  for  a  fourth 
order  process  after  setting  the  terms  for  a^  and  a-i  equal  to  zero,  where  a9  =  P7  and 
a3  =  p3. 

The  P  matrix  for  the  AR(1,4)  process  is  denned  such  that  P  equals: 
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be... 
d  e  f  .  . 
g  h  i  j  . 
P4    0     0    -Pl    1 


-P4   o     o    -Pl    1 


By  following  Siddiqui's  method  in  the  context  as  proposed  by  Beach  and  MacKinnon, 
10  equations  in  10  unknown  are  defined 


a2  +  b2  +  d2  +  g2  =   1  -  p42, 


(eqn  3.38) 


be  +  de  +  gh  =  -pj, 


(eqn  3.39) 


fd  +  ig  =  0, 


(eqn  3.40) 


gj  =  -P1P4, 


(eqn  3.41) 


c2  4-  e2  +  h2  =   1  +  pj2  -  p42, 


(eqn  3.42) 


fe  +  hi  =  -pj, 


(eqn  3.43) 
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jh  =  0,  (eqn  3.44) 


f2  +  i2  =   1  +  P!2-p42,  (eqn  3.45) 


ij  =  -Pp  (eqn  3.46) 


j2  =   1  -  p42.  (eqn  3.47) 

Solving  these  equations  for  the  10  unknowns,  the  elements  of  the  previously  defined  P 
matrix  are  determined  where 


2     u2     A2     „2 


a  =  V(l  -  P4    -  bz  -  dz  -  gz),  (eqn  3.48) 


b  =  (-pj  -  gh  -de)  /  c,  (eqn  3.49) 


c  =  V  (1  +  Pi    -  P4    -  hz  -  ez),  (eqn  3.50) 


2<l    n  2n  2, 


d  =  (-P!zp4V(i  -  p4z))/(d  -  p4z)  v  (d  -  p4z)z  -  P!  W»'  (ecin  3-51> 


e  =  (-PiV  (1  -  P4  ))  /  (V  (d  -  P4  f  -  Pi  P4  )),  (eqn  3.52) 


f  =  V  (((1  -  P4Z)Z  -  PjZP4Z)  /  (1  -  P4Z)),  (eqn  3.53) 
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g  =  -P1P4  /  V  (1  -P4  ),  (eqn3.54) 


h  =  0,  (eqn  3.55) 


i  =  -Pi  /  V(l  -P42),  (eqn  3.56) 


]  =  V  (1  -  P4  )•  (eqn  3.57) 

The  previously  defined  matrix  with  substituted  values  as  calculated  above  satisfies  the 
required  property  that  P'P  =  i|/    . 

D.      EFFECTIVENESS  OF  THE  AR(1,4)  PROCEDURE 

As  with  any  new  procedure,  it  is  important  that  its  effectiveness  relative  to  older 
procedures  be  determined.  In  this  case  the  AR(1,4)  procedure  will  be  compared  to  the 
two-step  procedure.    Specifically,  these  procedures  shall  be  compared  in  three  areas: 

•  Accuracy  of  estimate  for  p^, 

•  Ability  to  detect  the  AR(1,4)  disturbance 

•  Accuracy  of  estimates  for  regression  parameters 

Two  simulation  programs  written  in  APL  were  utilized  to  generate  data  for  the 
analysis  required  to  make  the  comparison.  These  programs  are  displayed  in  the 
appendix.  Program  Stats  is  used  to  generate  data  for  the  comparison  of  the  estimates 
of  P4  and  the  ability  to  detect  the  AR(1,4)  disturbance.  This  program  utilizes  the  APL 
programs  Norrand  and  Unirand  for  the  generation  of  normally  distributed  random 
numbers.  Norrand  and  Unirand  are  generic  APL  random  number  generators  on  the 
IBM  370/3033AP  computer  system  which  was  used  to  perform  the  simulations.  After 
the  program  generates  regression  residuals,  the  Durbin-Watson,  Wallis  and  Schmidt 
statistics  are  calculated.  The  Durbin-Watson  and  Schmidt  statistics  are  then  compared 
to  their  respective  bounds  for  the  selected  number  of  observations  and  independent 
variables.  If  the  statistic  detects  the  disturbance,  a  counter  will  be  incremented.  In 
addition  to  determining  whether  a  detection  is  made  or  not,  the  statistics  are  used  in 
previously  discussed  formulas  to  calculate  values  for  pj  and  p^. 
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Program  Regress  is  the  second  simulation  used.  Its  specific  purpose  is  to 
generate  data  which  will  allow  the  comparison  of  regression  parameters  from  both  the 
two-step  and  AR(1,4)  procedures.  This  program  as  does  the  previous,  utilizes  the 
Norrand  and  Unirand  routines  to  generate  normally  distributed  random  numbers.  In 
the  program,  values  for  one  independent  variable  and  intercept  term  were  generated  for 
a  predetermined  number  of  observations,  from  a  normal  distribution  with  zero  mean 
and  a  variance  of  .0625.  After  generating  the  error  term  from  the  AR(l,4)  model  for 
specified  values  of  Pj  and  p^,  the  dependent  variables  are  determined  by  way  of  the 
general  linear  model  with  a  value  for  P  of  one.  Using  this  data,  estimates  of  P  are 
determined  by  way  of  ordinary  least  squares,  the  two-step  procedure  and  the  AR(1,4) 
procedure. 

As  previously  discussed  in  Chapter  I,  it  was  decided  to  determine  how  well  the 
AR(1,4)  procedure  performs  relative  to  the  two-step  procedure  for  a  best  case  and 
worst  case  scenario.  For  the  best  case  scenario,  the  number  of  observations  was  100 
and  the  error  term  was  generated  from  a  normal  distribution  with  zero  mean  and  a 
variance  of  .01.  For  the  worst  case  the  number  of  observations  was  20  and  the 
variance  of  the  normal  distribution  was  10.  For  both  the  best  and  worst  case 
scenarios,  200  replications  were  performed  for  17  combinations  of  pi  and  p^.  The 
number  of  combinations  which  could  be  investigated  was  limited  by  the  initial  terms  of 
the  P  matrix  which  contained  square  roots.  The  problem  was  that  for  certain 
combinations  of  pi  and  p^  the  term  under  the  square  root  became  negative.  The  17 
combinations  of  Pj  and  p^  which  were  investigated  were 


1  Pi 

= 

P4  = 

.1 

2  P! 

= 

P4  = 

.3 

3  P! 
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P4  = 

.5 

4  Pt 
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.7 

5  P! 
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P4  = 

.9 

6  Pl 
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P4  = 

.1 

7  Pl 
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9  Pl 
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10  pj 
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12  pj 
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13  pj 
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.1 
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14  Pl  =  .7       p4  =  .3 

15  pL  =  .7       p4  =  .5 

16  Pl  =  .9       p4  =  .i 

17  p,  -  .9       p4=  .3. 

For  a  given  value  of  pj,  any  value  of  p4  greater  than  the  displayed  values  for  p4 
caused  the  problem  with  terms  under  square  roots  in  the  P  matrix.  In  subsequent 
graphs  used  to  display  results,  each  combination  of  pj  and  p4  will  be  referred  to  by  the 
number  as  assigned  above.  Thus  combination  one  is  given  where  pi  =  .1  and  p4  = 
.1.  The  effectiveness  of  the  procedure  in  handling  negative  autocorrelation  was  not 
investigated. 

Mean  square  error  (MSE)  was  selected  to  be  the  measure  by  which  the  best 
procedure  would  be  selected.  In  the  comparison  of  P  s  from  both  procedures,  MSE 
was  estimated  separately  for  the  slope  and  intercept  terms  utilizing  the  formula 

MSE  =  £  (P  "  u)2/n  (eqn  3.58) 

where  u  was  the  actual  value  of  the  slope  or  intercept  and  n  was  the  number  of 
simulation  replications.  Specifically,  the  value  of  u  was  unity  for  this  application.  In  a 
similar  manner,  the  MSE  for  the  vector  P  consisting  of  the  slope  and  intercept  term 
was  determined  via  the  formula 


MSE  =  V  {(jf0  .  ixPj  .  i))2/n  (eqn  3.59) 

where  Pq  is  the  intercept  term  and  P|  is  the  slope  term.    Mean  square  error  for  the 
comparison  of  p^'s  was  calculated  in  a  similar  manner. 

Power  curves  were  constructed  to  allow  the  determination  of  which  procedure 
could  best  detect  the  AR(1,4)  disturbance.  These  curves  were  constructed  by  plotting 
for  each  piand  p4  combination  the  number  of  times  each  procedure  detected  the 
disturbance  out  of  100  possible  attempts.  As  previously  mentioned,  if  a  trial  turned 
out  to  bt  inconclusive,  it  was  not  counted  as  a  detection.  The  number  of  times  for 
each  Pj,p4  combination  that  the  procedures  were  inconclusive  was  plotted  to  enable 
readers  who  consider  inconclusive  trials  as  detections  to  reevaluate  the  power  curves. 
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E.       RESULTS 

Comparisons  of  mean  square  error  for  estimates  of  p^  are  shown  in  Figures  3.1 
and  3.2.  Figure  3.1  displays  mean  square  errors  for  the  worst  case  scenario  while 
Figure  3.2  displays  MSEs  for  the  best  case  situation.  Mean  squared  errors  for  both  the 
best  and  worst  case  scenario  are  consistent  for  both  the  AR(1,4)  and  two-step 
procedure  estimates.  For  the  best  case  scenario  the  two-step  estimate  is  clearly 
superior  for  Pj.p^  combinations  greater  than  nine.  MSEs  for  two-step  and  AR(1,4) 
procedure  estimators  for  the  worst  case  scenario  diverge  for  Pj,p^  combinations  greater 
than  15.  This  indicates  that  for  the  worst  case  scenario,  the  AR(1,4)  procedure 
estimate  for  p^  can  be  expected  to  perform  as  well  as  the  two-step  estimate. 

As  previously  discussed,  an  individual  using  the  two-step  procedure  to  test  for  an 
AR(1,4)  disturbance  might  initially  use  the  Durbin-Watson  statistic.  If  the  test 
indicated  that  autocorrelation  did  not  exist,  further  testing  with  the  Wallis  statistic 
might  not  be  considered.  Since  this  was  the  case  assumed,  power  curves  were  created 
which  compared  the  Schmidt  and  Durbin-Watson  statistics.  The  Schmidt  statistic  is 
considered  as  part  of  the  AR(1,4)  procedure  while  the  Durbin-Watson  is  part  of  the 
two-step  procedure.  For  the  best  case  scenario  with  T  =  100  and  t  =  .01,  shown  in 
Figure  3.4,  the  Schmidt  statistic  except  for  the  sixth  combination  is  more  powerful  than 
the  Durbin-Watson  statistic.  For  this  scenario,  both  statistics  are  equally  powerful  for 
Pj,p^  combinations  greater  than  10.  For  the  worst  case  situation,  shown  in  Figure  3.3, 
both  statistics  are  equally  capable  across  all  combinations  in  detecting  the  AR(1,4) 
disturbance.  As  previously  discussed  the  power  curves  for  both  procedures  do  not 
include  inconclusive  test  results.  Figures  3.5  and  3.6  display  the  number  of  times  that 
the  test  statistics  were  inconclusive  for  the  worst  and  best  case  scenarios  respectively. 

Figures   3.7   through   3.12   show   comparisons   of  mean   square   errors   for  the 

regression  parameters  of  the  two-step  and  AR(1,4)  procedures.    Figures  3.7  and  3.8 

show   MSEs  calculated  using  both  Pq  and  P|   for  worst  and  best  case   situations, 

respectively.    For  the  worst  case  with  T  =  20,  x     =   10  scenario,  shown  in  Figure  3.7, 
/s 
MSEs  for  both  procedures  follow  similar  trends  which  are  consistent  with  each  other. 

Figures  3.9  and  3.10  which  display   MSEs  for  the  intercept  and  slope  coefficients, 

respectively,  show  trends  which  support  the  previous  findings  of  Figure  3.7.    Mean 

square  errors  for  the  best  case  scenario  are  shown  in  Figures  3.8,  3.11  and  3.12.    As 

shown  in  Figure  3.8,  both  procedures  follow  similar  trends  for  p^,P4  combinations  up 

to  15.    Clearly  the  two-step  procedure  diverges  at  the  p^p^  combination  15.    Figures 
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Figure  3.1     MSE  Companson  of  Estimators  for  p^  Worst  Case 
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Figure  3.2     MSE  Comparison  of  Estimators  for  p^  Best  Case 
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Figure  3.3     Power  of  Statistics  Worst  Case 
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Figure  3.4     Power  of  Statistics  Best  Case 
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Figure  3.5     Inconclusiveness  of  Statistics  Worst  Case 
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3.11  and  3.12  indicate  that  this  divergence  is  related  to  a  divergence  of  the  intercept 
estimate  Tor  the  two-step  procedure.  Overall,  it  appears  that  regression  parameters 
from  both  procedures  are  consistent  with  each  other  and  that  one  procedure  is  not 
superior  to  the  other  with  regards  to  the  estimation  of  regression  parameters. 

F.  CONCLUSIONS  AND  RECOMMENDATIONS 

Overall,  the  AR(1,4)  procedure  as  previously  developed  is  just  as  capable  as  the 
two-step  procedure  in  correcting  the  AR(1,4)  disturbance  for  the  PpP^  combinations 
for  which  it  can  be  used.  The  Schmidt  statistic  as  part  of  this  procedure  is  effective  in 
detecting  the  AR(1,4)  disturbance.  In  certain  situations,  its  abilities  surpass  those  of 
the  Durbin-Watson  statistic  from  the  two-step  procedure.  In  particular,  its  capabilities 
are  noteworthy  at  higher  values  of  T,  lower  values  oft  and  lower  pi,p^  combinations. 
It  appears  from  the  results  that  estimates  of  p^  calculated  via  the  two-step  procedure 
are  better  than  estimates  from  the  AR(1,4)  procedure,  at  least  for  the  order  of 
calculation  of  p^  which  was  tested.  If  an  individual  suspects  that  an  AR(1,4) 
disturbance  is  present  in  the  data  then  it  is  recommended  that  the  Schmidt, 
Durbin-Watson,  and  Wallis  statistics  be  calculated.  The  Schmidt  statistic  should  be 
utilized  to  detect  the  disturbance  while  the  Durbin-Watson  and  Wallis  statistics  are 
used  to  calculate  values  of  pi  and  p^.  If  the  values  for  pi  and  p^  are  acceptable,  then 
they  should  be  used  in  the  AR(1,4)  P  matrix  to  correct  for  the  disturbance.  If  the  Pj 
and  P4  values  fall  out  of  the  acceptable  range  for  the  AR(1,4)  procedure  then  the 
two-step  procedure  must  be  used. 

G.  AREAS  OF  FUTURE  RESEARCH 

Three  areas  of  future  research  are  suggested  as  a  result  of  this  research.  The  first 
of  these  would  be  to  find  a  P  matrix  for  the  AR(1,4)  procedure  which  was  not  limited 
by  values  of  pi  and  p^.  A  second  area  of  research  would  be  to  derive  a  correction 
procedure  for  a  general  fourth  order  autoregressive  process.  This  derivation  would 
follow  closely  what  was  done  above  for  the  AR(1,4)  process. 

A  final  area  of  research  would  be  to  investigate  the  maximum  likelihood 
estimation  of  a  general  AR(4)  or  an  AR(1,4)  process.  Beach  and  MacKinnon  [Ref.  11] 
have  investigated  the  second  order  process  and  suggest  possible  ways  to  extend  this 
procedure  to  the  general  fourth  order  case. 
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Figure  3.7    Comparison  of  Two-step  and  AR(1,4)  Procedure's  MSEs  Worst  Case 
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Figure  3.8     Comparison  of  Two-step  and  AR(1,4)  Procedure's  MSEs  Best  Case 
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Figure  3.9     Comparison  of  Two-step  and  AR(1,4)  Procedure's  Intercept  MSEs  Worst  Case 
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Figure  3.10    Comparison  of  Two-step  and  AR(1,4)  Procedure's  Slope  MSEs  Worst  Case 
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Figure  3.11     Comparison  of  Two-step  and  AR(1,4)  Procedure's  Intercept  MSEs  Best  Case 
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Figure  3.12     Comparison  of  Two-step  and  AR(1,4)  Procedure's  Slope  MSEs  Best  Case 
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APPENDIX 
PROGRAM  LISTINGS 

C    THIS  PROGRAM  COMPUTES  THE  LOWER  BOUND  FOR  THE  SCHMIDT  STATISTIC 

C    USING  IMHOFF'S  INTEGRAL.   THE  FILE  MAT  IS  A  FILE  OF  EIGENVALUES 

C    OBTAINED  FROM  THE  SUBROUTINE  LATRV.  THE  USER  IS  REQUIRED  TO  SET 

C    THE  NUMBER  OF  OBSERVATIONS  AND  THE  NUMBER  OF  INDEPENDENT  VARIABLES. 

C    ADDITIONALLY,  A  STARTING  FOR  XKRITL  SHOULD  BE  SUPPLIED. 

C    THE  PROGRAM  WILL  PRINT  VALUES  FOR  THE  INTERGRAL  AND  XKRITL 

C    AS  THE  PROGRAM  EXECUTES.   IN  THIS  WAY  THE  USER  IS  ABLE  TO 

C    SEE  THE  HOW  WELL  THE  INTEGRAL  IS  CONVERGING.   IF  THE  INTEGRAL 

C    IS  CONVERGING  TOO  SLOWLY  THEN  THE  VALUE  OF  XKRITL  SHOULD  BE 

C    CHANGED 

REAL  Af 100. 100) .RC 10000) ,V( 10000], D(100),L( 100), U( 100), S( 100) 

REAL  EPSl,EPS2,EPS3+FD,XKRlTL,XKRITU 

INTEGER  IND.N.lS.K.t.Nft.KP 

open(unit=io,file='mAt1} 

OPENf UNIT=12  FILE='OUT') 

XKRITL=0. 

XKRITL=3.40 
C         ASSIGN  VALUE  TO  ORDER  OF  MATRIX 

N=100 
C      SET  NUMBER  OF  INDEPENDENT  VARIABLES  (INCLUDE  INTERCEPT  TERM 

K=6 

T  Q  —  Kl  *  M 

C       "  SET  DESIRED  TRUNCATION  ERROR 

EPS1=. 0001 
C  SET  DESIRED  ACCURACY  OF  NUMERICAL  INTEGRATION 

EPS2=. 0001 
C  SET  DESIRED  ACCURACY  OF  DISTRIBUTION  FUNCTION 

EPS3=. 001 
C  SET  VALUE  OF  DISTRIBUTION  FUNCTION 

C     FD=. 05 
C  SET  INPUT  CODE 

IND=1 
C      ADJUST  FOR  THE  NUMBER  OF  COLUMNS  OF  THE  MATRICES  'A'  AND 
C         YX'  WHICH  ARE  LINEAR  COMBINATIONS  OF  EACH  OTHER 

T=N-K 
C  GENERATE  THE  'A'  MATRIX 

CALL  SCHMDT(A.N) 

WRITEflO  *)((A(1,J),  J=1,N),  1=1, N) 

REWIND  16 
C  READ  IN  'A*  MATRIX  FOR  WHICH  EIGENVALUE  AND  EIGENVECTORS 

C  WILL  BE  COMPUTED 

READ  (10. M  (R(I).  1=1, IS) 

c       commute  the  Eigenvalues  and  eigenvectors  of  matrix  'a1 

CALL  LATRV(R.N.V,D.IND) 
WRITE(12,*)(D(1)  1=1, N) 
40  REWIND  12 

READ(12  *J(D(I).  1=1. N) 

C  SEf  THE  NUMBER  dF  NON-ZERO  EIGENVALUES 

NR=T 
C  SORT  THE  N-K  SMALLEST  EIGENVALUES 

KP=K-1 

DO  10  J=1,T 

L(J)=D(J+KP) 
10  CONTINUE 
C         COMPUTE  LOWER  BOUND  FOR  STATISTIC 

CALL  FQUAD(L,NR_,FD,EPS1AEPS2, XKRITL) 

XKRITD=ABS(XKRltL-XKRIT6) 

XKRITO=XKRITL 

IF((ABS(FD-.05)-EPS3).  LE.  0)  GO  TO  30 

IF((FD-.05).LT.0)  THEN 
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XKRITL=XKRITL+. 5*XKRITD 

PRINT  *AFD,XKRITL 

GO  TO  40 

END  IF 

IF[(FD-.05).GT.  0)  THEN 

XKRITL=XKRITL-.5*XKRITD 

PRINT  *.FD,XKRITL 

GO  TO  46 

END  IF 
30  PRINT  *,FD,XKRITL 

STOP 

END 
C  SUBROUTINE  LATRV 

C 

C     PURPOSE:  FIND  THE  EIGENVALUES  AND  VECTORS  OF  A  REAL  SYMMETRIC 
C     MATRIX 
C 

C     DESCRIPTION  OF  VARIABLES: 

C         R=  A  REAL  SYMMETRIC  MATRIX,  DESTROYED  DURING  COMPUTATIONS 
C  AND  REPLACED  BY  THE  TRANSPOSED  MATRIX  OF  EIGENVALUES 

C         N=  ORDER  OF  THE  MATRICES 
C         V=  MATRIX  OF  EIGENVECTORS 

C         D=  VECTOR  OF  EIGENVALUES  IN  DESCENDING  ORDER 
C       IND=  INPUT  CODE 

IF  IND=0  EIGENVALUES  AND  EIGENVECTORS  ARE 
C  COMPUTED 

C  IF  IND=1  EIGENVALUES  ARE  COMPUTED  ONLY 

C 

C      METHOD:  DIAGONALIZATION  METHOD  OF  JACOBI 
C 

REFERENCE:  NATIONAL  PHYSICAL  LABORATORIES,  'MODERN  COMPUTING 
C  METHODS^ 

C 

c 

SUBROUTINE  LATRV  (R,N,V.D, IND) 
DIMENSION  R(l),V(l),D(l) 
INDEX(I.J)=WJ-lrft 
C         GENERATE  IDENTITY  MATRIX 
IFf IND)  1,1,4 

1  NN=N*N 

IJ=N+1 

DO  2  1=1, NN 

2  V(I)=0 

DO  3  I=1,NN,IJ 

3  V(I)=1 

C        COMPUTE  FINAL  NORM  EPS 

4  EPS=0 

DO  6  1=1, N 
DO  6  J=1,N 
IJ=INDEXfj.n 
IF(I-J)  5,6,5 

5  EPS=EPS+R(1J)**2 

6  CONTINUE 

IF(EPS-.OOOOOOl)  95.95,7 

7  EPS=SQRT(EPS)*.000006l/N 

DREMP=1 

8  L2=0 

DO  80  LR=1,N 
IR=INDEX(O.LR) 
DO  80  LK=Lft,N 
IFfLK-LRl  80,80,10 
10  IK=INDEX(6,LK) 
K1=LR+IK 
K2=LR+IR 
K3=LK+IK 

IFfABS(ARK)-DREMP)  80,80,20 
20  L2=l 
C  COMPUTE  SIN  AND  COS 
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ARR=R(K2) 

AKK=R(K3) 

XL=2*ARK 

XM=ARR-AKK 

SQ=SQRT(XL*XL+XM*XM) 

STN2T=XL/SQ 

IF(XM)  25.26,26 

25  SIN2T=-SIN2T 

26  C0ST=SQRTfXl+ABSjfXM)/SQ)/2) 

SINT=STN2T/(2*C0ST) 

SINT2=SINT**2 

C0ST2=C0ST**2 

MODIFY  MATRICES 
DO  40  J=1,N 
K4=J+IR 
K5=J+IK 
K6=INDEX| 
K7=INDEX| 
ARJ=R(K4 
AKJ=R(K5' 

R(K4)=ARJ*C0ST+AKJ*SINT 
R(K5)=AKJ*C0ST-ARJ*SINT 


R(K6)=R(K4) 
0 


R(K7)=R(K5' 
40  CONTINUE   ' 

K7=LK+IR 

R(K1)=0 

R(K7)=0 

R(K2)=ARR*COST2+AKK*SINT2+ARK*SIN2T 

R(K3)=ARR*SINT2+AKK*COST2-ARK*SIN2T 

IRIND)   45,45,80 
45     DO  50   1=1, N 

K1=I+IR 

K2=I+IK 

ARR=V(K1) 

V(K1)=C0ST*ARR+SINT*AKK 
50     V(K2)=COST*AKK-SINT*ARR 
80     CONTINUE 

IF(L21  8,90,8 
90     DREMP=DREMP*: 1 
C  COMPARE  THRESHOLD  DREMP  WITH  NORM  EPS 

IF(DREMP-EPS)   95,8,8 
C  SORT  EIGENVALUES  AND  EIGENVECTORS 

95     DO   100   1=1. N 
IJ=INDEX(I,I) 
100     D(I)=R[IJ) 

DO  130  1=1, N 
DO  130  J=I'N 

IF(D(J)-D(i))  130,130,110 
110  ARR=D  I) 

D(I)=D(J) 
D(JJ=ARR 

IF(IND)  115,115,130 
115  DO  120  K=1.N 

IJ=INDEX(fc,I) 

IK=INDEX(K  J) 

ARK=V(IJ) 

V(IJ)=V(IK) 

V(IK)=ARK 
120  CONTINUE 
130  CONTINUE 
C         REPLACE  R  BY  TRANSPOSED  MATRIX  OF  EIGENVECTORS 

IF(IND)  135,135,150 
135  DO  140  1=1. N 

DO  140  J=1,N 

IJ=INDEX(J  I) 

IK=INDEX(I  J) 
140  R(IJ)=V(IK) 
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150  RETURN 
END 
C  SUBROUTINE  FQUAD 

C    PURPOSE:  FIND  THE  DISTRIBUTION  FUNCTION  F(X)=P(Q  .  LE.  X)  OF 
C  QUADRATIC  FORMS  IN  NORMAL  VARIABLES. 

C  DESCRIPTION  OF  PARAMETERS 

C  S     =  A  VECTOR  OF  LENGTH  NR  CONTAINING  THE  NON-ZERO 

C  EIGENVALUES 

C  NR      NUMBER  OF  NON-ZERO  EIGENVALUES 

C  FD      VALUE  OF  THE  DISTRIBUTION  FUNCTION 

C  EPS1  =  DESIRED  TRUNCATION  ERROR  TU 

C  EPS2  =  DESIRED  ACCURACY  OF  NUMERICAL  INTEGRATION 

C  XKRIT  =  VALUE  OF  X 

C 

C  METHOD:  THE  INTEGRAL  DERIVED  BY  IMHOF  IS  USED  AND  NUMERICALLY 

C  INTEGRATED  BY  SIMPSONrS  RULE 

C 

REFERENCE:  'COMPUTING  THE  DISTRIBUTION  OF  QUADRATIC  FORMS  IN 

C  NORMAL  VARIABLESr  BY  J.  P.  IMHOF 

C  BIOMETRIKA  (1961),  48,  3  AND  4,  P. 419 

c 

SUBROUTINE  FQUAD(S , NR , FD , EPS1 , EPS2 , XKRIT) 

DIMENSION  S(l) 

FD=0. 

UB=0. 

C2=0. 

C1=0. 

FBL=0. 

VINT2=0. 

FD1=0. 

SUMK=0. 

XK=0. 

SLAM=0. 

DO  6  1=1, NR 

S(I)=S(l5-XKRIT 

Yk=xK+  5 

6  SLAM=SLAM+.  5*ALOG(ABS(S(IV)) 

C         COMPUTE  UPPER-BOUND  OF  INTEGRAL  GIVEN  THE  TRUNCATION 
C         ERROR 

UB=EXP(-fALOG(EPSl*XK)  +  1.14472989  +  SLAM)/XK) 
C     PRINT  *,UB  ,   NX 

7  TU=EXP(1. 14472989  +  ALOG(XK)  +  SLAM  +  XK*ALOG(UB)) 
C     PRINT  *,TU 

VAL=1.>TU-EPS1 
C  PRINT  *,VAL 

GO  TO  20 


IF((1./TU-EPS1).  LT.O]  GO 
IFffTU-EPSn.EQ.  0)  GO  TO  1 
IFCC1.  /TU-EPS1).  GT.  0)  GO 


J.GT.  0)  GO  TO  9 
UB=UB""+  5./XK 
GO  TO  7 

COMPUTATION  OF  THE  INTEGRAND  GIVEN  THE  VALUE  OF  U 

10  IF(U)  15,15,11 

11  TETA=0. 

RHO=l. 

DO  14  1=1, NR 

C1=S(I)*U 

C2=l.  +  Cl**2 

TETA=TETA  +  ATAN(Cl) 

14  RHO=RHO*(C2**.25) 

TETA=. 5*(TETA) 
FBL=SIN(TETA)/(RHO*U) 
GO  TO  18 

15  FBL=0. 

DO  16  1=1, NR 

16  FBL=FBL+Sm 

FBL=. 5*(FBL-XKRIT) 
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18     GO  TO  (21.22.23.24.25).   KSKIP 
C  EVALUAYlCfN  CfF  fHE  TNTEGRAL  BY  SIMPSON'S  RULE 

20  RANGE=UB 

MH=1 

U=RANGE*. 5 
KSKIP=1 
GO  TO   10 

21  SUMK=FBL*RANGE*2.  /3. 

U=0. 

KSKIP=2 
GO  TO   10 

22  VINT2=SUMK+FBL*RANGE/6. 

U=UB 
KSKIP=3 
GO  TO  10 

23  VINT2=VINT2  +  FBL*RANGE/6. 

FD=. 5-.318309886*VINT2 

DO  28  NIT=1,14 

FD1=FD 

VINT2=(VINT2-SUMK*.5)*. 5 

MH=2*MH 

STEP=RANGE/MH 

U=STEP*.5 

KSKIP=4 

GO  TO  10 

24  SUMK=FBL 

IFCMH.  EQ.  2)  THEN 

U=U+STEP 

KSKIP=5 

GO  TO  10 

END  IF 

DO  25  K=2,MH 

U=U+STEP 

KSKIP=5 

GO  TO  10 

25  SUMK=SUMK+FBL 

SUMK=SUMK*STEP*2./3. 
VINT2=VINT2  +  SUMK 
FD=. 5  -  V318309886*VINT2 
IFCNIT-3)  28,28^27 

27  IF(ABS(FD1-FD)-EPS2)  29,28,28 

28  CONTINUE 

PAUSE  1234 

29  RETURN 

END 
C  SUBROUTINE  SCHMDT 

C 

C    PURPOSE:  GENERATE  THE  'A'  MATRIX  FOR  WHICH  EIGENVALUES  ARE 
C  COMPUTED 

C 

C    DESCRIPTION  OF  PARAMETERS 

C  A  =  MATRIX  FOR  WHICH  EIGENVALUES  ARE  COMPUTED 

C  N  =  ORDER  OF  MATRIX 

C 
C 

SUBROUTINE  SCHMDT(A,N) 

REAL  A(100,100) 

DO  10  1=1, N 

DO  20  J=1,N 

A(IvJ)=0. 
20  CONTINUE 
10  CONTINUE 

A  1,1)=2. 

A  1,2  =-1. 

ACll5)=-l. 

DO  30  1=2, N 

J=I 

IFfl.LE.  4)  THEN 

A(1,J)=3. 
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A(I,(J+1))=-1. 

A(I,(J+4))=-l. 

A(I,(J-1))=-1. 

END  IF 

IF((I,GE.  (N-3)).  AND.  (I.  LT.  N))  THEN 

A(I  ,J)=3. 

'J+l))=-l. 

J-l))=-l. 

•J-4))=-l. 

F 

EQ. N)  THEN 
AC  J)=?/ 

A 

END 

I 

A 


IF((I.GT.  4).  AND.  (I.LT.  (N-3)))  THEN 

A(I,J)=4. 

AC  I  (J-4j)=-l. 


Afl  fJ-l))=-l. 
A(I  (J+li)=-l. 
AfI,(J+4))=-l. 

END  IF 
30  CONTINUE 
RETURN 
END 
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C    THIS  PROGRAM  COMPUTES  THE  UPPER  BOUND  FOR  THE  SCHMIDT  STATISTIC 

C    USING  IMHOFF'S  INTEGRAL.   THE  FILE  MAT  IS  A  FILE  OF  EIGENVALUES 

C    OBTAINED  FROM  THE  SUBROUTINE  LATRV.  THE  USER  IS  REQUIRED  TO  SET 

C    THE  NUMBER  OF  OBSERVATIONS  AND  THE  NUMBER  OF  INDEPENDENT  VARIABLES. 

C    ADDITIONALLY,  A  STARTING  FOR  XKRITU  SHOULD  BE  SUPPLIED. 

C    THE  PROGRAM  WILL  PRINT  VALUES  FOR  THE  INTERGRAL  AND  XKRITU 

C    AS  THE  PROGRAM  EXECUTES.   IN  THIS  WAY  THE  USER  IS  ABLE  TO 

C    SEE  THE  HOW  WELL  THE  INTEGRAL  IS  CONVERGING.   IF  THE  INTEGRAL 

C    IS  CONVERGING  TOO  SLOWLY  THEN  THE  VALUE  OF  XKRITU  SHOULD  BE 

C    CHANGED 

REAL  Af 100. 100). R( 10000) ,V( 10000), D( 100), L( 100), U( 100), S(  100) 

REAL  EPSl.£PS2;£PS3.FD.XftRITU 

INTEGER  IND.N.lS.K.t.Nft.KP 

OPEN(UNIT=ld,FIL^=,MAT1} 

OPENf UNIT=12,FILE='0UT' ) 

XKRITU=0. 

XKRITU=3.80 
C  ASSIGN  VALUE  TO  ORDER  OF  MATRIX 

N=20 
C      SET  NUMBER  OF  INDEPENDENT  VARIABLES  (INCLUDE  INTERCEPT  TERM 

K=5 

T  Q M  *  M 

C       "  SET  DESIRED  TRUNCATION  ERROR 

EPS1=. 0001 
C  SET  DESIRED  ACCURACY  OF  NUMERICAL  INTEGRATION 

EPS2=. 0001 
C  SET  DESIRED  ACCURACY  OF  DISTRIBUTION  FUNCTION 

EPS3=. 001 
C  SET  VALUE  OF  DISTRIBUTION  FUNCTION 

C     FD=. 05 
C  SET  INPUT  CODE 

IND=1 
C      ADJUST  FOR  THE  NUMBER  OF  COLUMNS  OF  THE  MATRICES  'A'  AND 
C         TX'  WHICH  ARE  LINEAR  COMBINATIONS  OF  EACH  OTHER 

T=N-K 
C  GENERATE  THE  'A'  MATRIX 

CALL  SCHMDT(A,N) 

WRITEC10  *)((A(I,J),  J=1,N),  1=1, N) 

REWIND  10 
C  READ  IN  'A'  MATRIX  FOR  WHICH  EIGENVALUE  AND  EIGENVECTORS 

C  WILL  BE  COMPUTED 

READ  C 10  *)   (R(I).  1=1, IS) 
C  COMMUTE  THE  EIGENVALUES  AND  EIGENVECTORS  OF  MATRIX  'A' 

CALL  LATRV(R,N,V,D,IND) 

WRITE(12,*)(D(1)  1=1, N) 
40  REWIND  12 

READ(12  *HD(I).  1=1. N) 
C  SET  THE  NUMBER  dF  NON-ZERO  EIGENVALUES 

NR=T 
C  SORT  THE  N-K  LARGEST  EIGENVALUES 

DO  20  1=1, T 

U(I)=D(_I) 
20  CONTINUE 
C         COMPUTE  LOWER  BOUND  FOR  STATISTIC 

CALL  FQUADTU.NR,FD,EPS1.EPS2. XKRITU) 
C        SUCCESSIVELY  HALF  XKRITU  To  FIND  THE  5  PERCENT 
C  SIGNIFICANCE  POINT 

XKRITD=ABS(XKRITU-XKRITO) 

XKRITO=XKRITU 

IF((ABS(FD-.05)-EPS3).  LE.  0)  GO  TO  30 

IF((FD-.05).LT.O)  THEN 

XKRlTU=XKRlTU+.  5*XKRITD 

PRINT  *  FD, XKRITU 

GO  TO  40 

END  IF 

IF((FD-.05).GT.  0)  THEN 

XKRlTU=XKRlTU-.5*XKRITD 

PRINT  *AFD, XKRITU 

GO  TO  46 
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END  IF 
30  PRINT  *,FD,XKRITU 

STOP 

END 
C  SUBROUTINE  LATRV 

C 

C      PURPOSE:  FIND  THE  EIGENVALUES  AND  VECTORS  OF  A  REAL  SYMMETRIC 
C      MATRIX 
C 

C      DESCRIPTION  OF  VARIABLES: 

C         R=  A  REAL  SYMMETRIC  MATRIX,  DESTROYED  DURING  COMPUTATIONS 
C  AND  REPLACED  BY  THE  TRANSPOSED  MATRIX  OF  EIGENVALUES 

C         N=  ORDER  OF  THE  MATRICES 
C         V=  MATRIX  OF  EIGENVECTORS 
C         D=  VECTOR  OF  EIGENVALUES  IN  DESCENDING  ORDER 
C       IND=  INPUT  CODE 

IF  IND=0  EIGENVALUES  AND  EIGENVECTORS  ARE 
C  COMPUTED 

C  IF  IND=1  EIGENVALUES  ARE  COMPUTED  ONLY 

C 

C      METHOD:  DIAGONALIZATION  METHOD  OF  JACOBI 
C 

C       REFERENCE:  NATIONAL  PHYSICAL  LABORATORIES,  'MODERN  COMPUTING 
C  METHODS1^ 

C 

c 

SUBROUTINE  LATRV  (R,N,V,D,IND) 
DIMENSION  R(l),V(l),D(l) 

C         GENERATE  IDENTITY  MATRIX 
IF(IND)  1,1,4 

1  NN=N*N 

IJ=N+1 

DO  2   1=1, NN 

2  V(I)=0 

DO  3   I=1,NN,IJ 

3  V(I)=1 

C  COMPUTE  FINAL  NORM  EPS 

4  EPS=0 

DO  6   1=1, N 
DO  6  J=1,N 
IJ=INDEX£J.I) 
IF(I-J)   5.6,5 

5  EPS=EPS+R(1J)**2 

6  CONTINUE 

IF(EPS-.  0000001)   95.95.7 

7  EPS=SQRT(EPS)*.0000001/N 

DREMP=1 

8  L2=0 

DO  80   LR=1,N 
IR=INDEX(0,LR) 

DO  80   LK=LR\N 
IF(LK-LR)  80,80,10 
10     IK=INDEX(0,LK) 
K1=LR+IK 
K2=LR+IR 
K3=LK+IK 

AR^RfKl) 

IFfABS(ARK)-DREMP)  80,80,20 
20  L2=l 
C  COMPUTE  SIN  AND  COS 

ARR=R(K2) 

AKK=R(K3) 

XL=2*ARK 

XM=ARR-AKK 

SQ=SQRT(XL*XL+XM*XM) 

SIN2T=xt/SQ 

IF(XM)  25,26,26 
25  SIN2T=-SIN2T 
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26     COST= 
S 


ST=SQRTjfri+ABSCXM^)/SQ)/2) 
INT=SIN2T/(2*C0ST) 

SINT2=SINT**2 
C0ST2=C0ST**2 

MODIFY  MATRICES 
DO  40  J=1,N 
K4=J+IR 
K5=J+IK 
K6=INDEXl 
K7=INDEX| 
ARJ=R(K4 
AKJ=R(K5; 

R(K4)=ARJ*C0ST+AKJ*SINT 
R(K5)=AKJ*C0ST-ARJ*SINT 
R(K6)=R(K4) 

R(K7)=R(K5) 
40  CONTINUE 

K7=LK+IR 

R(K1)=0 

R(K7)=0 

R(K2)=ARR*C0ST2+AKK*SINT2+ARK*SIN2T 

R(K3)=ARR*SINT2+AKK*C0ST2-ARK*SIN2T 

IF(IND)  45,45,80 
45  DO  50  1=1, N 

K1=I+IR 

K2=I+IK 

ARR=V(K1) 

AKK=V(K2) 

V(K1)=C0ST*ARR+SINT*AKK 
50  V(K2)=C0ST*AKK-SINT*ARR 
80  CONTINUE 

IF(L21  8,90,8 
90  DREMP=DREMP*. 1 

COMPARE  THRESHOLD  DREMP  WITH  NORM  EPS 

IF(DREMP-EPS)  95,8,8 

SORT  EIGENVALUES  AND  EIGENVECTORS 
95  DO  100  1=1. N 

IJ=INDEX(f,I) 
100  D(I)=R(IJ) 

DO  130  1=1, N 

DO  130  J=I,N 

IF(D(J)-D(I))  130,130,110 
110  ARR=D(I) 

D(I)=D[J) 

D(J)=ARR 


IF(lND)  115,115,130 
115  DO  120  K=1,N 

IJ=INDEX(K,I) 

IK=INDEX(K  J) 

ARK=V(IJ] 

V(IJ)=V{IK) 

V(IK)=ARK 
120  CONTINUE 
130  CONTINUE 
C        REPLACE  R  BY  TRANSPOSED  MATRIX  OF  EIGENVECTORS 

IF(IND)  135,135,150 
135  DO  140  1=1. N 

DO  140  J=i,N 


IJ=INDEX(J,I) 


IK=INDEX( 
140  R(IJ}=V(IK 
150  RETURN 
END 
C  SUBROUTINE  FQUAD 

C    PURPOSE:  FIND  THE  DISTRIBUTION  FUNCTION  FfX)=P(Q  .  LE.  X)  OF 
C  QUADRATIC  FORMS  IN  NORMAL  VARIABLES. 

C    DESCRIPTION  OF  PARAMETERS 
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C  S    =  A  VECTOR  OF  LENGTH  NR  CONTAINING  THE  NON-ZERO 

C  EIGENVALUES 

C  NR     NUMBER  OF  NON-ZERO  EIGENVALUES 

C  FD     VALUE  OF  THE  DISTRIBUTION  FUNCTION 

C  EPS1  =  DESIRED  TRUNCATION  ERROR  TU 

C  EPS2  =  DESIRED  ACCURACY  OF  NUMERICAL  INTEGRATION 

C  XKRIT  =  VALUE  OF  X 

C 

C    METHOD:  THE  INTEGRAL  DERIVED  BY  IMHOF  IS  USED  AND  NUMERICALLY 

C  INTEGRATED  BY  SIMPSON'S  RULE 

C 

C    REFERENCE:  'COMPUTING  THE  DISTRIBUTION  OF  QUADRATIC  FORMS  IN 

C  NORMAL  VARIABLESr  BY  J. P.  IMHOF 

C  BIOMETRIKA  (1961),  48,  3  AND  4,  P. 419 

u 

c 

SUBROUTINE  FQUAD(S,NR,FD,EPS1 ,EPS2, XKRIT) 

DIMENSION  S(l) 

FD=0. 

UB=0. 

C2=0. 

C1=0. 

FBL=0. 

VINT2=0. 

FD1=0. 

SUMK=0. 

XK=0. 

SLAM=0. 

DO  6  1=1, NR 

S(I)=S(I)-XKRIT 

6  SLAM=SLAM+.  5*ALOG(ABS(S(IV)) 

C        COMPUTE  UPPER-BOUND  OF  INTEGRAL  GIVEN  THE  TRUNCATION 
C         ERROR 

UB=EXP(-(ALOG(EPSl*XK)  +  1.14472989  +  SLAM)/XK) 
C     PRINT  *,UB  ,  vs 

7  TU=EXP(1. 14472989  +  ALOG(XK)  +  SU\M  +  XK*ALOG(UB)) 
C     PRINT  *,TU 

VAL=1.>TU-EPS1 
C     PRINT  *,VAL 

IF((l.yTU-EPSl).  LT.  0]  GO  TO  20 
IF((TU-EPS1].EQ.0J  GO  TO  9 
IF(C1.  /TU-EPS1).  GT.  0)  GO  TO  9 

9  UB=UB  +  5./XK 

GO  TO  7 
C        COMPUTATION  OF  THE  INTEGRAND  GIVEN  THE  VALUE  OF  U 

10  IFOI)  15,15,11 

11  TETA=0. 

RH0=1. 

DO  14  1=1, NR 

C1=S(I)*U 

C2=l:  +  Cl**2 

TETA=TETA  +  ATAN(Cl) 

14  RHO=RHO*(C2**.25) 

TETA=.  5*(JETA) 
FBL=SIN[TETA)/(RHO*U) 
GO  TO  18 

15  FBL=0. 

DO  16  1=1, NR 

16  FBL=FBL+Sm 

FBL=. 5*(FBL-XKRIT) 
18     GO  TO  (21.22.23,24:25).    KSKIP 
C  EVALUXTKJN  tfF  fHE  TNTEGRAL  BY  SIMPSON'S  RULE 

20  RANGE=UB 

MH=1 

U=RANGE*. 5 
KSKIP=1 
GO  TO   10 

21  SUMK=FBL*RANGE*2.  /3. 
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u=o. 

KSKIP=2 

GO  TO  10 

22  VINT2=SUMK+FBL*RANGE/6. 

U=UB 
KSKIP=3 
GO  TO  10 

23  VINT2=VINT2  +  FBL*RANGE/6. 

FD=. 5-. 318309886*VINT2 

DO  28  NIT=1,14 

FD1=FD 

VINT2=(VINT2-SUMK\5)\5 

MH=2*MH 

STEP=RANGE/MH 

U=STEP*. 5 

KSKIP=4 

GO  TO  10 

24  SUMK=FBL 

IFfMH.  EQ.  2)  THEN 

U=U+STEP 

KSKIP=5 

GO  TO  10 

END  IF 

DO  25  K=2,MH 

U=U+STEP 

KSKIP=5 

GO  TO  10 

25  SUMK=SUMK+FBL 

SUMK=SUMK*STEP*2./3. 
VINT2=VINT2  +  SUMK 
FD=. 5  -  ,318309886*VINT2 
IF(NIT-3i  28,28^27 

27  IF(ABS(FD1-FD)-EPS2)  29,28,28 

28  CONTINUE 

PAUSE  1234 

29  RETURN 

END 
C  SUBROUTINE  SCHMDT 

C 

C    PURPOSE:  GENERATE  THE  'A1  MATRIX  FOR  WHICH  EIGENVALUES  ARE 
C  COMPUTED 

C 

C    DESCRIPTION  OF  PARAMETERS 

C  A  =  MATRIX  FOR  WHICH  EIGENVALUES  ARE  COMPUTED 

C  N  =  ORDER  OF  MATRIX 

C 
C 

SUBROUTINE  SCHMDT(A,N) 

REAL  A(100,100) 

DO  10  1=1, N 

DO  20  J=l  N 

A(I  J)=0. 


)0  20  v. 
DNTlNUl 


20  CONTINUE 
10  CONTINUE 

A(l,l)=2. 

All  2)=-l. 

AC1  5J=-1. 


DO  3*0  1=2',  N 

IFfl.LE.  4)  THEN 
A(1,J)=3. 
A(I,(0+1))=-1. 
Afl  (J+4}1=-1. 

Afi  h-i))=-i. 

END  IF 

IFf  (I.GE.  (N-3)).  AND.  (I.  LT.  N))  THEN 

A(I  ,J)=3. 

A  I  fj+lj)=-l. 

A(I,(J-1))=-1. 
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IF(I.EQ.N)  THEN 

A(I  (j-l))=-l. 
Afl  (J-4))=-l. 

END  IF 

IF((I,GT.  4).  AND.  (I.  LT.  (N-3)))  THEN 
A(I,J)=4. 
ACI  (J-4))=-l. 
A(I,(J-1))=-1. 
Afl  (J+l)1=-l. 
A(I,[J+4))=-l. 
END  IF 
30  CONTINUE 
RETURN 
END 
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Program  Stats  is  an  APL  simulation  which  was  used  to  generate  data  for  the 
comparison  of  the  estimates  of  p^  and  the  ability  of  the  Schmidt  statistic  as  part  of  the 
AR(1,4)  procedure  and  the  Durbin- Watson  statistic  as  part  of  the  two-step  procedure 
to  detect  the  AR(1,4)  disturbance. 

V  STATS ;  AD1 ;  AD ;  AW1B ;  AW1 ;  AW2  •,  AW3  ;  AWVB ;  AW**  ;  AW5  ;  AW ;  AS ;  PW 

[I]  p   ENTER  VALUES  FOR  RHOl  AND  RHO* 
[  2  ]  •  ENTER  A  VALUE  FOR  RHO 1 ' 

[3]  RH01+U 

[  4  ]  » ENTER  A  VALUE  FOR  RHOn  ' 

[5]  RHOn+U 

[  6  ]  i  ENTER  NUMBER  OF  OBSERVATIONS  ' 

[7]  NOBS+U 

[  8  ]  '  ENTER  VARIANCE  OF  NORMAL  DISTRIBUTION  • 

[9]  SIG2+Q 

[10]  o  INITIALIZE  BOUND  FOR  STATISTICAL  TESTS 

[II]  DWL+1.65 
[12]  DWU+1.69 
[13]  DWAL+1.59 
[14]  DWAU+1.63 
[15]  DSMTL+3.H5 
[16]  DSMTU+3. 53 

[  1 7  ]  q  INITIALIZE  OUTER  LOOP  VECTORS  AND  COUNTERS 

[18]  RHOAVl+^OpO 

[19]  RHOAVnS+nOpO 

[20]  RHOAVV+nQpO 

[21]  i?l«-40p0 

[22]    i?4S«-40p0 

[23]    /?4«-40p0 

[24]  DWCNT+0 

[2  5]  DSTCNT+0 

[26]  DWACNT+0 

[27]  DWINCT+0 

[28]  STINCT+0 

[29]  DWAICT+0 

[3  0]  fl  GENERATE  DURBIN -WATSON   » A'  MATRIX 
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[31]   NO+NOBS 

[32]   4Z)lB<-(((iV0-2)pO),~l,2,~l)),((A/0-2)pO),_l,l 

[3  3]  ADl+i , "1 , ( ( (NO* (NO-2 ) )+(N0-2 ) )pADlB 

[34]  AD+(NO,NO)pADl 

[3  5]  p   GENERATE  THE  WALLIS  'A'  MATRIX 

[36]    Atflfl«-(((tf0-5)-3)pO) 

[3  7]    AtflO((((iV0-5)  +  l)pO),l,O,O,O,~l)),AtflS) 

[38]     fltfl«-l,O,O,O,~l,((((3xJV0)+3)pAJ/lC 

[3  9]    Atf2*(((JV0-8)xtf0)-9)p(((JV0-8)pO),~ltO,O,O,2,OfO,O,'l) 

[40]    4#3<-~l,0,0,0,2,0,0,0,~l,4tf2 

[41]    A*mfl«-((<3xiV0)  +  3))p((((JV0-5)  +  l)pO),  '1,0,  0,0,1) 

[42]    4tf4<-  ((  (JV0-  5  )  -3  )p0),~  1,0,  0,0,1,^^45 

[43]    JUy5<-Atfl,Jltf3,AtfU 

[44]   AW+(NO,NO)pAW5 

[  4  5  ]  p  GENERATE  THE  SCHMIDT  •  A  "  MATRIX 

[46]  A5<-AZ?+AV 

[  4  7  ]  p  0£/TE5  L00P  F0P  REPLICATION 

[4  8]  COUNTO+0 

[4  9]  OUTER iCOUNTO+COUNTO  +  1 

[  5  0  ]  p  INITIALIZE  INNER  LOOP  VECTORS 

[51]  RHOHATl+lOpO 

[52]  RHOHATHS+lOpO 

[53]  /?#0ffAr4«-lOpO 

[  54]  p  IMVE7?  L00P  T0  CALCULATE  RHOl ,  Pff04  4tf£>  DETERMINE  DETECTION 

[5  5]  COUNTI+0 

[56]  IAWEP :  COUNTI+COUNTI+1 

[57]  DETDW+0 

[58]  DETSMT+0 

[59]  DETWA+0 

[60]  INSTCT+0 

[61]  INDWCT+0 

[62]  INDWAT+0 

[63]  r  GENERATE  THE  RANDOM  ERROR 

[64]  V+NOBS  NORRANDiO  ,SIG2) 

[6  5]  E+NOBSpO 
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fl  CALCULATE  CORRECTION  TERMS 
J<-(l-(RHOn*2))*0.5 
I+-(RH01*J) 
G+RHOnxI 

F<r(l+(RH01*2)-(RHOH*2)-(I*2))*0.5 
E1+-(RH01*F) 
D+-(I*G)*F 

C+(l+(RH01*2)-(RHOn*2)-(El*2))*0.5 
B1+- (RH01- (D*E1) )*C 

A<r(l-(RHOV*2)-(G*2)-(D*2)-(B1*2))*0.5 
p       CALCULATE  THE  FIRST  FOUR  ERROR  TERMS 
EZ11+V111*A 

EZ2l  +  (VZ2l-(BlxEll]  ))*C 
E[3]«-(y[3]-(0xE[l]  )-(Elx£[2]  ))*F 
E[4]«-(7[4]-(GxE[l]  )-(IxE[3]  ))*«/ 
MM:W«-W+1 
EM<-(i?tf01xE[JV-l]  )+(flff04x£[W-4]  )+7[JV] 

E+(NOBS,l)pE 

a  GENERATE  THE  INDEPENDENT  VARIABLES 

Xl<r  (NOBS ,  1 )  p  (NOBSp  1 ) 

X2+(N0BS  ,l)p  (NOBS  NORRAND  0  0.062  5) 

X«-X1,X2 

fl  GENERATE  THE  TRUE  BETA 

BETAT+  2  1  p  1  1 

fl  generate  the  dependent  variables 

y+(x+.*betat)+e 

fl  least  squares  estimate  of  beta 

b+y®x 

fl  GENERATE  THE  RESIDUALS 

EHAT+Y-(X+.xB) 

fl  CALCULATE  THE  DURBIN-WATSON  STATISTIC 

DW<r ,  (  ( §EHAT )+.x(AD+.  *EHAT )  ) *  (  ( §EHAT )  + .  *EHAT ) 
[100]  fl  CALCULATE  THE  SCHMIDT  STATISTIC 
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[66] 

[67] 

[68] 

[69] 

[70] 

[71] 

[72] 

[73] 

[74] 

[75] 

[76] 

[77] 

[78] 

[79] 

[80] 

[81] 

[82] 

[83] 

[84] 

[85] 

[86] 

[87] 

[88] 

[89] 

[90] 

[91] 

[92] 

[93] 

[94] 

[9  5] 

[96] 

[97] 

[98] 

[99] 

[101]  DS<- ,  (  ( §EHAT  )  +  .  x  (AS+ .  xEHAT  )  )  *  ( ( §EHAT )  +  .  xEHAT  ) 

[102]  p  COMPARE  TABLED  VALUE  FOR  DURBIN-WATSON  STATISTIC  TO 

[10  3]  r  THE  ABOVE  CALCULATED  VALUE  FOR  THE  DURBIN-WATSON 

[104]  DETDW+iDWZDWL) 

[10  5]  DWCNT+DWCNT+DETDW 

[106]  INDWCT+((DWL<DW)a(DWU>DW)) 

[107]  DWINCT+DWINCT+INDWCT 

[10  8]  p  COMPARE  TABLED  VALUE  FOR  SCHMIDT  STATISTIC 

[109]  p  TO  ABOVE  CALCULATED  VALUE  FOR  SCHMIDT  STATISTIC 

[110]  DETSMT+iDSZDSMTL) 

[111]  DSTCNT+DSTCNT+DETSMT 

[112]  INSTCT+((DSMTL<DS)*(DSMTU>DS)) 

[113]  STINCT+STINCT+INSTCT 

[114]  p  CALCULATE  THE  DURBIN-WATSON  ESTIMATE  FOR  RHOl 

[115]  RHOHAT1LCOUNTI1+1- (DWi2) 

[116]  R  CALCULATE  THE  SCHMIDT  ESTIMATE  FOR  RHOn 

[117]  RHUS1+2-RHOHAT1LCOUNTI1 

[118]  RHnS2+DS*2 

[119]  RHOHATHSICOUNTH +RHVS1 -RHUS2 

[12  0]  r  GENERATE  TRANSFORM  MATRIX  FOR  FIRST  ORDER  PROCESS 

[121]  PIR,  (l-(J?ff0ffATl[C0£WZT]*2))*O.5 

[122]  PAR((W0xW0)-l)p(((MRl)pO), (-RHOHAT11COUNTI1 ),1) 

[123]  PDW+(NO,NO)pPD,PW 

[124]  a  TRANSFORM  RESIDUALS 

[12  5]  ESTAR+PDW+.xEHAT 

[126]  p  CALCULATE  THE  WALLIS  STATISTIC 

[127]  DM*,  ((^srA/?)+.x(^^+.x£'5r^i?))t((is}E5r^/?)+.x£'sr^/?) 

[12  8]  p  COMPARE  TABLED  VALUE  OF  WALLIS  STATISTIC 

[12  9]  fl  T0  CALCULATED  VALUE  OF  WALLIS  STATISTIC 

[130]  DETWA+(DWA<>DWAL) 

[131]  DWACNT+DWACNT+DETWA 

[13  2]  INDWAT+ ( (DWAL<DWA ) a (DWAU>DWA ) ) 

[133]  DWAICT+DWAICT+INDWAT 

[134]  p  CALCULATE  THE  WALLIS  ESTIMATE  FOR  RHOn 

[13  5]  i?tf0tfAr4[C0tW2T]«-l-(IW;U2) 
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[136]   -»» (  COUNT K  5  )  /INNER 
[13  7]   RH0AVllC0UNT01+(+/RH0HATl)*5 
[13  8]   RHOA  V^S  [  CO  UNTO  3  «-  (  +  /RHOHAT^S  )  *  5 
[13  9]   flff  04  74  [  CO  UN  TO  ]  «•  ( +  /RHOHATn  )  *  5 
[140]    +(COUNTO<n0) /OUTER 
[141]    CNTDW+DWCNT 
[142]    CNTDST+DSTCNT 
[143]    CNTDWA+DWACNT 
[144]   J?l«-J?tf0Ayi 
[14  5]   RUS+RHOAVUS 
[146]    /?4«-/?ff0A74 
V 
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Program  Regress  is  an  APL  simulation  which  generates  data  for  the  comparison 
of  the  regression  parameters  from  both  the  two-step  and  AR(1,4)  procedures. 

V  REGRESS ;  AD  ;AD1  ;AN1B  ;AW1  ;AW2  ;AW3  ;AWVB  -,AWH  ;AW5  ;  AW ;  AS 
[  1  ]  p   ENTER  VALUES  FOR  RHO 1  AND  RHOn 
[  2  ]  '  ENTER  A  VALUE  FOR  RHO  1  ■ 
[3]  RHOl+U 

[4]  '  ENTER  A  VALUE  FOR  RHOn  ' 
[5]  RHOn+U 

C 6  ]  '  ENTER  NUMBER  OF  OBSERVATIONS ' 
[7]  NOBS+U 

C  8  ]  ■  ENTER  VARIANCE  OF  NORMAL  DISTRIBUTION  ■ 
[9]  SIG2+U 

DSMTL+3.*i5 

DSMTU+3.5  3 

p  GENERATE  DURBIN-WATSON  »A"  MATRIX 

NO+NOBS 

ADlB+(t((NO-2)pO) ,~1 ,2 ,~D) ,  (<M?-2)pO) ,"l ,1 

A01«-l,~l,  (((W0x(;V0-2))  +  (iV0-2))pAr;iB 

ilZ?-«-(#0,JV0)p£01 

p  GENERATE  THE  WALLIS   ■  4  '  MATRIX 

AWlB+(((NO-5)-3)pO) 

AWlC+((((NO-5)+l)pO) ,1 ,0 ,0 ,0 ,~1)) ,AW1B) 

Atfl<-l,0,0,0,~l,  ((((3xA?<9)  +  3  )p£tflC 

A#2«-<  ((#0-8)x#0)-9)p(((ff0-8)pO), "1,0,0 ,0,2,0, 0,0, "l) 

AAr3*"l,0,0,0,2,0,0,0,"l,ilW2 

Atf4B<-(((3xJV0)+3))p((((JV0-5)  +  l)pO),~l,O,O,O,l) 

£#»M-(((ff0-5)-3)pO),~l,O,O,O,l,Jltf4B 

JUr5-*-JUri  ,473  .AW 

Atf<-(W0,JV0)piW/5 

p  GENERATE  THE  SCHMIDT  •  A  '  MATRIX 

AS+AD+AW 

p  GENERATE  TRANSFORM  MATRIX  FOR  FIRST  ORDER  PROCESS 

NO+NOBS 

PD+,  (l-(J?ff01*2))*O.5 

Ptf<-  ( ( WO  x M? )  - 1 )  p  ( (  ( NO  - 1  )  p  0  )  ,  (  -RHO  1  )  ,  1 ) 
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[10] 

[11] 

[12] 

[13] 

[14] 

[15] 

[16] 

[17] 

[18] 

[19] 

[20] 

[21] 

[22] 

[23] 

[24] 

[25] 

[26] 

[27] 

[28] 

[29] 

[30] 

[31] 

[32] 

C33] 

[34] 

[35] 

[36] 

[37] 

[38] 

[39] 

[40] 

[41] 

[42] 

[43] 

[44] 

[45] 

[46] 

[47] 

[48] 

[49] 

[50] 

[51] 

[52] 

[53] 

[54] 

[55] 

[56] 

[57] 

[58] 

[59] 

[60] 

[61] 

[62] 

[63] 

[64] 

[65] 

[66] 

[67] 

PDW+(NO,NO)pPD,PW 

p  GENERATE  TRANSFORM  FOR  FOURTH  ORDER  PROCESS 

ri«-(l-(i?tf04*2))*O.5 

P41«-n,  ((M?-l)p0) 

P42«-O,n,((W0-2)pO) 

P43«-0,0,n,  ((M9-3)p0) 

P44<-0,0,0,n,  ((M9-4)p0) 

P45*((W0-4)xW0)p((-Pff04),O,O,O,l,((M9-4)pO)) 

Pn+(NO,NO)p (P41 ,P42 ,P43 ,P44,P45 ) 
R  INITIALIZE  MATRICES  AND  VECTORS  FOR  OUTER  LOOP 
BLSIAV+VOpO 
BLSSAV+^OpO 
B2SIAV+n0p0 
B2SSAV+U0p0 
BmiA  7-6-40  pO 
B  mSA  7-«-40  pO 
DSBCNT+0 
DSACNT+0 
STIBCT+0 
STANCT+0 

p  OUTER  LOOP  FOR  REPLICATION 
COUNT 0+0 

OUTER : COUNTO+COUNTO+1 

p  INITIALIZE  MATRICES  AND  VECTORS  FOR  INNER  LOOP 
BLSI+5p0 
BLSS+5pO 
B2SI+5pO 
B2SS+5pO 
S14i>5p0 
S14S«-5pO 

R  INNER  LOOP  TO  CALCULATE  RHOl ,  RHOn  AND  DETERMINE  DETECTION 
COUNTI+0 

INNER : COUNTI+COUNTI+1 
p  GENERATE  THE  RANDOM  ERROR 
V+NOBS  NORRAND  (  0  ,  SIG2  ) 

70 


[6  8]  E+NOBSpO 

[69]  N+U 

[70]  R  CALCULATE  CORRECTION  TERMS 

[71]  «7<-(l-(/?ff04*2))*O.5 

[72]  I+-(RH01*J) 

[73]  G+RHOnxI 

[74]  F«-(l  +  (P#01*2)-(Pff04*2)-(I*2))*O.5 

[75]  F1<--(FJ701*F) 

[76]  Z?«c-(IxG)*F 

[77]  O(l  +  (Ptf01*2)-(P#04*2)-(Fl*2))*O.5 

[78]  Bl«~(f?fl01-(JDxEl))*C 

[7  9]  d«-(l-(FJ704*2)-(G*2)-(£*2)-(£l*2))*O.5 

[  8  0  ]  p  DETERMINE  TRANSFORM  MATRIX  FOR  1 , 4  PROCESS 

[81]  P14K-A,  ((JVO-l)pO) 

[82]  P142«-BlfC,(<2V0-2)pO) 

[83]  P143«-Z?,F1,F,  ((M9-3)p0) 

[84]  P144<-G,0,I,c7,((WO-4)pO) 

[8  5]  P14  5^-((iVO-4)x^O)p((-i?^C4),0,0,(-i?ffCl),l,((^<9-4)p0)) 

[86]  P14«-(W0,iV0)p(P141,P142,P143,P144,P14  5) 

[87]  a  CALCULATE  THE  FIRST  FOUR  ERROR  TERMS 

[88]  F[l]«-7[1]*4 

[89]  F[2]<-(7[2]-(BlxF[l]  ))*C 

[90]  F[3]«-(7[3]-(Dx£[l]  )-(FlxF[2]  ))*F 

[91]  F[4]«-(7[4]-(Gx£[l]  )-(JxF[3]  ))W 

[92]  MMlN+N+1 

[93]    F[W]<-(P#aixE[tf-l]  )  +  (Ptf04xF,[/y-4]  )+7[W] 

[94]    +(N<N0BS)/MM 

[95]    E+(N0BS,l)pE 

[96]  p  GENERATE  THE  INDEPENDENT  VARIABLES 

[97]  Xl*-(lV0BS,l)p(iV0flSpl) 

[98]  X2+(N0BS  ,l)p  (NOBS  NORRAND  0  0.0625) 

[99]  X«-X1,X2 

[100]  r  GENERATE  THE  TRUE  BETA 

[101]  BETAT+  2  1  p  1  1 

[102]  a  GENERATE  THE  DEPENDENT  VARIABLES 

71 


[103]  Y+(X+.*BETAT)+E 

[104]  p  LEAST  SQUARES  ESTIMATE  OF  BETA 

[10  5]  BLS+Y&X 

[106]  BLSI ICO UNTI1+BLS Tl ; ] 

[107]  BLSSLCOUNTIl+BLSHil 

[10  8]  a  GENERATE  THE  RESIDUALS 

[109]  EHAT+Y-(X+.xBLS) 

[110]  r  £/S£  FIRST  ORDER  P  MATRIX  TO  TRANSFORM  Y ,  X ,  AND  EH  AT 

[111]  a  TRANSFORM Y 

[112]  JSTAPH-PZW+.xY 

[113]  p  TRANSFORM  X 

[114]  XSTARl+PDW+.xX 

[115]  ft  TRANSFORM  RESIDUALS 

[116]  ESTARl+PDW+.xEHAT 

[117]  R  USE  FOURTH  ORDER  P  MATRIX  TO  TRANSFORM  YSTAR1 ,  XSTAR1 

[118]  r  ANDESTAR1 

[119]  fl  TRANSFORM Y 

[120]  YSTAP4<-P4+.xYSrdPl 

[121]  R  TRANSFORM  X 

[122]  xsrAfl4<-P4+.xxsrAi?i 

[123]  fl  TRANSFORM  RESIDUALS 

[124]    £SrAP4«-P4  +  .x£;S2yiPl 

[12  5]  fl  CALCULATE  VALUE  FOR  2  -STEP  BETA 

[126]    B2S«-YSr;iP4!XS2'4P44 

[127]  B2SI[C0CWri]«-B2S[l;] 

[128]  S2SS[C0CW2T]«-P2S[2;] 

[129]  fl  CALCULATE  THE  SCHMIDT  STATISTIC  BEFORE 

[13  0]  fl  TRANSFORMATION 

[131]  DSlC+((§EHAT)+.xEHAT) 

[132]  Z?5^,((^MD  +  .x(A5+.xE^r))*Z?SlC 

[13  3]  fl  COMPARE  TABLED  VALUE  FOR  SCHMIDT  STATISTIC 

[134]  a  TO  ABOVE  CALCULATED  VALUE  FOR  SCHMIDT  STATISTIC 

[13  5]  DEBSMT+iDSSDSMTL) 

[13  6]  fl  IF  DETECTED  INCREMENT  1 

[13  7]  DSBCNT+DSBCNT+DEBSMT 
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[13  8]  INSBCT+ ( (DSMTKDS ) a (DSMTU>DS ) ) 

[13  9]  p  IF  INCONCLUSIVE  INCREMENT  1 

[140]  STIBCT+STIBCT+INSBCT 

[141]  p  USE  1 ,  4  P  MATRIX  TO  TRANSFORM  Y,X,  AND  EH  AT 

[142]  p  TRANSFORM! 

[143]  YSTARm+pm+.xY 

[144]  p  TRANSFORM  X 

[145]  XSTARlK+PlK+.xX 

[146]  r  CALCULATE  VALUE  FOR  1,4  BETA 

[147]  S14^-Y5r^i?14iX52'^i?14 

[148]    B14J[C0£/MT]<-514[1;] 

[149]    BmSLCOUNTIl+Bmi2;l 

[150]  r  CALCULATE  RESIDUALS 

[151]    ES2VLi?14«-YSr4i?14-(XSrA/?14+.xB14) 

[15  2]  r  CALCULATE  THE  SCHMIDT  STATISTIC  AFTER 

[15  3]  r  THE  TRANSFORMATION 

[154]    DS2Z?^-((^E527^i?14)+.xES2'A/?14) 

[155]  DS+ ,  (  (§ESTARm)+  .*  (AS+ .  xESTARm)  )*DS2D 

[156]  R  COMPARE  TABLED  VALUE  FOR  SCHMIDT  STATISTIC 

[15  7]  a  TO  ABOVE  CALCULATED  VALUE  FOR  SCHMIDT  STATISTIC 

[158]  DETSMA+(DS<DSMTL) 

[15  9]  a  IF  DETECTED  INCREMENT  1 

[16  0]  DSACNT+DSACNT+DETSMA 

[161]  INSACT<t((DSMTL<DS)a(DSMTU>DS)) 

[16  2]  pIF  INCONCLUSIVE  INCREMENT  1 

[16  3]  STANCT+STANCT+INSACT 

[164]  ->  ( COUNT K  5  )  /INNER 

[165]  p  CALCULATE  AVERAGE  VALUES  FOR  BETAS 

[166]  SLSJA7[C0C/^rC]<-(+/5L5I)*5 

[167]  BLSSAf [C0£Wr0]«-(+/BLSS)*5 

[168]  B2SJ4f[C0£Wr0X+/£2SI)*5 

[169]  £2SS;il7[C0CWr0]<-(+/B2SS)*5 

[170]  B14JA7[CCC/iVrO]<-(+/B14J)*5 

[171]  £i4s;mc0:wz'0]«-(+/si4S)*5 

[17  2]  +(COUNTO<*0) /OUTER 
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[173]  AF+DSACNT 
[174]  BE+DSBCNT 
[17  5]  LSI+BLSIAV 
[176]  LSS+BLSSAV 
[177]  I2S+B2SIAV 
[178]  S2S+B2SSAV 

[179]  nu<-BmjA7 

[180]    SIU+BIUSAV 
V 
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